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1 

RECOMBINANT EXPRESSION VECTORS AND 
PURMCATION METHODS FOR THERMITS THERMOPHH J TS 

DNA POLYMERASE 

. The present invention relates to a purified, thermostable DNA polymerase 
5 purified from Thermus thenmophilus and recombinant means for producing the 
enzyme. Thermostable DNA polymerases are useful in many recombinant DNA 
techniques, especially nucleic acid amplification by the polymerase chain reaction 
(PGR). 

Extensive research has been conducted on the isolation of DNA polymerases 
10 firom mesophilic microorganisms such as £. coli- See, for example. Bessman £i al., 
1957, 1 JBid. Chfim. 221:171-177 and Buttin and Komberg, 1966, 1 Biol. Chem . 
2^:5419-5427. 

Much less investigation has been made on Uie isolation and purification of DNA 
polymerases ftom tiieimophiles such as Thermus thermophilus . Kaledin ei al., 1980, 

15 PiolJiymiya 45:644-65 1 disclose a six-step isolation and purification procedure of 
DNA polymerase from cells of X aguaricus YT-1 strain. These steps involve isolation 
of crude extract, DEAE-cellulose chromatography, fiactionation on iiydroxyapatite, 
fiactionation on DEAE-cellulose, and chromatography on single-strand DNA-cellulose. 
The pools firom each stage were not screened for contaminating endo- and 

20 exonuclease(s). The molecular weight of tiie purified enzyme is reponed as 62,000 
daltons per monomeric unit. 

A second purification scheme for a polymerase from Thermus aguaricus is 
described by Chien £l al- 1976, 1 Bacieclfll. 122:1550-1557. In tiiis process, the 
cmde extract is applied to a DEAE-Sephadex column. The dialyzed pooled fractions 

25 are tiien subjected to treatment on a phosphocellulose column. The pooled fractions are 
dialyzed and bovine serum albumin (BSA) is added to prevent loss of polymerase 
activity. The resultingmixture is loaded on a DNA-cellulose column. The pooled 
material fiom the column is dialyzed and analyzed by gel filtration to have a molecular 
weight of about 63,000 daltons and by sucrose gradient centrifugation of about 68,000 

30 daltons. 

The use of thermostable enzymes, such as those prepared by Chien sa M • and 
Kaledin £i a]., to amplify existing nucleic acid sequences in amounts diat are large 
compared to the amount initially present was described in U.S. Patent Nos. 4,683,195; 
^ 4,683,202; and 4,965,188, which describe tiiePCR process. Primere, template, 
35 nucleoside triphosphates, the appropriate buffer and reaction conditions, and a 

polymerase are used in die PGR process, which involves denaturation of target DNA, 
hybridization of primers, and synthesis of complementary strands. The extension 
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product of each primer becomes a template for the production of the desired nucleic acid 
sequence. The patents disclose that, if the polymerase employed is a thermostable 
enzyme, then polymerase need not be added after every dehaturation step, because heat 
will not destroy the polymerase activity. 

5 European Patent Publication No. 258,017; PCX Publication No. 89/0669 1 ; and 

U.S. Patent No. 4,889,818 describe the isolation and recombinant expression of an 
-94 kDa thcraiostable DNA polym^asc from Ihsnmis aoiiatiaiS and tiie use of that 
polymerase in PGR, Although I, aquaticus DNA polymerase is especially preferred 
for use in PGR and other recombinant DNA techniques, there remains a need for other 

10 thermostable polymerases. 

Accordingly, there is a desire in the art to produce a purified, thermostable DNA 
polymerase that may be used to improve the PGR process described above and to 
improve the results obtained when using a thermostable DNA polymerase in other 
recombinant techniques such as DNA sequencing, nick-translation, and even reverse 

15 transcription. The present invention helps meet tiiat need by providing recombinant 
expression vectors and purification protocols for Thermus thermophilus DNA 
polymerase. 

Accordingly, the present invention provides a purified thermostable enzyme that 
catalyzes combination of nucleotide triphosphates to form a nucleic acid strand 

20 complementary to a nucleic acid template strand The purified enzyme is the DNA 
polymerase from Thermus thermophilus (Ttii) and has a molecular weight predicted 
firom the nucleic acid sequence of the gene of about 94 kDa, This purified material 
may be used in a temperature-cycling amplification reaction wherein nucleic acid 
sequences are produced from a given nucleic acid sequence in amounts that are large 

25 compared to the amount initially present so that the sequences can be manipulated 
and/or analyzed easily. 

The gene encoding Ttii DNA polymerase enzyme finom Thermus themiophilvs 
has also been identified and cloned and provides yet another means to prepare tiie 
thermostable enzyme of tiie present invention. In addition to the gene encoding the Tth 

30 enzyme, gene derivatives encoding Tth DNA polymerase activity are also provided. 
The invention also encompasses a stable enzyme composition comprising a 
purified, tiiermostable Ttii enzyme as described above in a buffer containing one or 
more non-ionic polymeric detergents. 

Finally, the invention provides a method of purification for the thermostable 

35 polymerase of the invention. This metiiod involves preparing a crude extract firom. 
Thenmus tiiermophilus cells, adjusting the ionic strength of tiie crude extract so tiiat the 
DNA polymerase dissociates from nucleic acid in the extract, subjecting the extract to 
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hydrophobic interaction chromatography, subjecting the extract to DNA binding protein 
affinity chromatography, and subjecting the extract to cation or anion exchange or 
hydroxyapatite chromatography. In a prefeired embodiment, these steps are carried out 
sequentially in the order given above, and non-ionic detergent is added to Uie extract 

5 prior to the DNA binding protein affinity chromatography step. The nucleotide binding 
protein affinity chromatography step is prefeired for separating the DNA polymerase 
from endonuclease proteins. 

The present invention provides DNA sequences and expression vectors that 
encode Tth DNA polymaase. To facilitate understanding of the invention, a number of 
10 terms are defined below; 

The terms "cell," "cell line," and "cell culture" can be used interchangeably and 
all such designations include progeny. Thys, the words "transformants" or 
"transformed cells" include Uie primary transformed cell and cultures derived from that 
cell without regard to the number of transfers. All progeny may not be precisely 
15 identical in DNA content, due to deliberate or inadvenent mutations. Mutant progeny 
that have the same functionality as screened for in the originally transformed cell are 
included in the definition of transformants. 

The term "control sequences" refers to DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. The 
20 control sequences tiiat are suitable for procaryotes, for example, include a promoter, 
optionally an operator sequence, a ribosome binding site, and possibly other 
sequences. Eucar>'otic cells are known to utilize promoters, polyadenylation signals, 
and enhancers. 

The term "expression system" refers to DNA sequences containing a desired 
25 coding sequence and control sequences in operable linkage, so Uiat hosts transformed 
witii tiicse sequences are capable of producing tiie encoded proteins. To effect 
transformation, the expression system may be included on a vector, however, the 
relevant DNA may also be integrated into the host chromosome. 

The term "gene" refers to a DNA sequence that encodes a recoverable bioactive 
30 polypeptide or precursor. The polypeptide can be encoded by a full-length gene 

sequence or by any portion of the coding sequence so long as tiie enzymatic activity is 
retained. 

The term "operably linked" refers to the positioning of the coding sequence 
such that control sequences will function to drive expression of die protein encoded by 
35 the coding sequence. Thiis, a coding sequence "operably linked" to control sequences 
refers to a configuration wherein the coding sequences can be expressed iinder the 
control of a control sequence. 
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The tenn "mixture" as it relates to mixtures containing Tth polymerase refers to 
a collection of materials which includes Tth polymerase but which can also include 
other proteins. If the Tth polymerase is derived firom recombinant host cells, the other 
proteins will ordinarily be those associated with the host Where the host is bacterial, 

5 the contaminating proteins will, of course, be bacterial proteins. 

The term "non-ionic polymeric detergents" refers to surface-active agents that 
have no ionic charge and that are characterized, for purposes of this invention, by an 
ability to stabilize the Tth enzyme at a pH range of firom about 3,5 to about 9.5, 
preferably from 4 to 8.5. 

10 The term "oligonucleotide" as used herein is defined as a molecule comprised of 

two or more deoxyribonucleotides or ribonucleotides, preferably more than three, and 
usually more than ten. The exact size will depend on many factors, which in turn 
depends on the ultimate function or use of the oligonucleotide. The oligonucleotide 
may be derived synthetically or by cloning. 

15 The term "primer" as used herein refers to an oligonucleotide, whether 

occurring naturally as in a purified restriction digest or produced synthetically, which is 
capable of acting as a point of initiation of synthesis when placed under conditions in 
which synthesis of a primer extension product which is complementary to a nucleic acid 
strand is initiated, i.e., in the presence of four dififCTent nucleoside triphosphates and 

20 the Tth thermostable enzyme in an appropriate buffer ("buffer" includes pH, ionic 

strength, cofactors, etc.) and at a suitable temperamre. For Tth polymerase, the buffer 
preferably contains 1 to 3 mM of a magnesium salt, preferably MgCli, 50-200 of 
each nucleotide, and 0,5 to 1 ^iM of each primer, along with 50 mM KCl, 10 mM Tris 
buffer, pH 8-8.4, and 100 jig/ml gelatin (altiiough gelatin is not required and should be 

25 avoided in some applications, such as DNA sequencing). 

The primer is single-stranded for maximum efficiency in amplification, but may 
alternatively be double-stranded. If double-stranded, tiie primer is first treated to 
separate its strands before being used to prepare extension products. The primer is 
usually an oligodeoxyribonucleoride. The primer must be sufficientiy long to prime the 

30 synthesis of extension products in the presence of the polymerase enzyme. The exact 
length of a primer will depend on many factors, such as source of prinier and result 
desired, and the reaction temperamre must be adjusted depending on primer length to 
ensure proper annealing of primer to template. Depending on the complexity of the 
target sequence, the oligonucleotide primer typically contains 15 to 35 nucleotides. 

35 Short primer molecules generally require cooler temperatures to form sufficientiy stable 
complexes with template. 
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A primer is selected to be "substantially" complementaiy to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize 
with a template strand for primer elongation to occur. A primer sequence need not 
reflect the exact sequence of the template. For example, a non-complementaiy 
5 nucleotide fragment may be attached to the 5' end of die primer, with the remainder of 
the primer sequence being substantially complementary to the strand. Non- 
complementaiy bases or longer sequences can be interspersed into the primer, provided 
that the primer sequence has sufficient completnentarity with the sequence of the 
template to hybridize and thereby forai a template primer complex for synthesis of the 
10 extension product of the primer. 

The terms "restriction endonucleases" and "restriction enzymes" refer to 
bacterial en2ymes which cut double-stranded DNA at or near a specific nucleotide 
sequence. 

The terai "thermostable enzyme" refers to an enzyme which is stable to heat and 
15 is heat resistant and catalyzes (facilitates) combination of die nucleotides in the proper 
manner to fonn primer extension products tiiat arc complementary to each nucleic acid 
strand. Generally, syntiiesis of a primer extension product begins at die 3* end of the 
primer and proceeds in die 5' direction along die template strand, until syntiiesis 
terminates. 

20 The Ttii tiieraaostable enzyme of die present invention satisfies tiie requirements 

for effective use in die amplification reaction known as die polymerase chain reaction. 
The Tdi enzyme does not become irreversibly denatured (inactivated) when subjected to 
die elevated temperatures for die time necessary to effect denaniration of double- 
stranded nucleic acids, a key step in die PGR process. Iircvereible denaturation for 

25 purposes herein refers to peraaanent and complete loss of enzymatic activity. The 
heating conditions necessary for nucleic acid denaturation will depend, e.g., on die 
buffer salt concentration and die composition and lengdi of die nucleic acids being 
denatured, but typically range from about 90 to about 105*C for a time depending 
mainly on die temperanirc and die nucleic acid lengdi, typically from a few seconds up 

3Q to four minutes. Higher tempcranires may be tolerated as die buffer salt concentration 
and/or GC composition of die nucleic acid is increased. The Tdi enzyme does not 
become irreversibly denatured for relatively short exposures to temperanires of about 
90-100'C. 

The Tdi diermostablei enzyme has an optimum temperamre at which it functions 
35 diat is higher dian about 50'C. Temperatures below 50'C faciUtate hybridization of 
primer to template, but depending on salt composition and concentration and primer 
composition and lengdi, hybridization of primer to template can occur at higher 
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teii5)eraturcs (c.g., 45-70T), which may promote specificity of the primer elongation 
reaction. The higher the temperature optimum for the enzyme, the greater the 
spedfidtyancVor selectivity of the primer^iirectcd extension process. The optimum 
temperature for Tth activity ranges from about 50 to 90*C 

5 The present invention provides the DNA sequence encodmg a full-length 

tiiermostable DNA polymerase of Thermus thermophilus . This DNA sequence and tiie 
deduced amino add sequence are depicted below. For convenience, the amino add 
sequence of this Tth polymerase is numbered for reference, and other forms of the 
thermostable enzyme are designated by referring to changes fiom Uie full length, native 

10 sequence. 
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The DNA and amino add sequences shown above and the DNA compounds 
that encode those sequences can be used to design and construct recombinant DNA 
expression vectors to drive expression of Tth DNA polymerase activity in a wide 
variety of host cells. A DNA compound encoding all or part of the DNA sequence 

5 shown above can also be used as a probe to identify thermostable polymerase-encoding 
DNA from other organisms, and the amino acid sequence shown above can be used to 
design peptides for use as immunogens to prepare antibodies that can be used to 
identify and purify a thermostable polymerase. 

Whether produced by recombinant vectors that encode the above amino acid 

10 sequence or by native Thermus thcrmophilus cells, however, Tth DNA polymerase will 
typically be purified prior to use in a recombinant DNA technique. The present 
invention provides such purificarion methodology. For recovering the native protein 
the cells are grown using any suitable technique. Briefly, the cells are grown on a 
medium, in one liter, of nitrilotriaceric acid (100 mg), tryptone (3 g), yeast extract (3 

15 g)» succinic acid (5 g), sodium sulfite (50 mg), riboflavin (1 mg), K2HPO4 (522 mg), 
MgS04 (480 mg), CaCli (222 mg), NaCl (20 mg), and trace elements. The pH of the 
medium is adjusted to 8.0 ± 0.2 with KOH. The yield is increased up to 20 g of 
cells/liter if cultivated with vigorous aeration at a temperature of 70"C. Cells in the late 
logarithmic growth stage (determined by absorbance at 550 nm) are collected by 

20 centrifiigation, washed with a buffer and stored frozen at -20'C. 

In another method for growing the cells, a defined mineral salts medium 
containing 0.3% glutamic acid supplemented with 0.1 mg/1 biotin, 0.1 mg/1 thiamine, 
and 0.05 mg/1 nicotinic acid is employed The salts include nitrilotriacetic acid, CaS04, 
MgS04, NaCl, KNO3, NaNOs. ZnS04, H3BO3, CUSO4, NaMo04. C0CI2, FeCb, 

25 MnS04, and Na2HP04. The pH of tiie medium is adjusted to 8.0 witii NaOH. The 
cells are grown initially at 75*C in a water bath shaker. On reaching a certain density, 
one liter of these cells is transferred to a 14-liter fermentor. Sterile air is bubbled 
through the cultures and the temperature maintained at 75*C. The cells are allowed to 
grow for eight hours before being collected by centrif ugation. 

30 After cell grov/th, the isolation and purification of the enzyme takes place in six 

stages, each of which is carried out at a temperature below room temperature, 
preferably about 4*C. In the first stage or step, the cells, if frozen, are thawed, 
disintegrated with an Aminco ftench pressure cell (1 8,000 psi), suspended in a buffer 
at about pH 7.5, and centrifuged. In the second stage, the supernatant is collected and 

35 then fractionated by adding a salt such as dry ammonium sulfate and Polymin P to 
remove nucleic acids. The pellet (at 0.2 M NH4SO4) is discarded. 
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The supernatant from the second stage is applied to a phenyl sepharose column 
equilibrated with a buffer composed of 0.2 M (NH4)2S04, 50 mM Tris-HCl, pH 7.5, 
and 0.5 mM DTT. Then the column is washed°first with buffer ( 1 ): IE buffer 
containing 0.5 mM DTT and 0.2 M (NH4)2S04, then with buffer (2): TE buffer 
5 containing 8.5 mM DTT, then with buffer (3): buffer (2) containing 20% ethylene 
glycol. The protein is eluted in buffer (4): buffer (3) buffer containing 2 M urea. 

In the fourth step, the eluate collected in the third step is applied to a heparin 
sepharose column equilibrated with 0. 1 5 m Kd. The column is then washed in the 
same buffer and the enzyme eluted with a linear gradient of a buffer such as 0. 1 5 M to 
10 0.75 KCl. The activity peak is at 0.3 1 to 0.355 M KCl. 

In the fifth stage, the fraction collected in the fourth step is concentrated and 
diafiltered against Aftigel-blue buffer. The precipitate foraied is removed by 
centrifugation, and the supernatant is applied to an Affigel-blue column equilibrated 
with 0.1 M KCl. The column is then washed with 0.1 M KCl and the enzyme eluted 

15 with a linear gradient of a buffer such as 0.1 to 0..5 M KCl. Fractions with 

thcraapstable enzyme activity arc then tested for contaminating deoxyribonucleases 
(endo- and exonucleases) using any suitable procedure. For example, the endonuclease 
activity may be determined electrophoretically from the change in molecular weight of 
phage X DNA or supercoiled plasmid DNA after incubation with an excess of DNA 

20 polymerase. Similarly, exonuclease activity may be deteraiined electrophoretically 
from the change in molecular weight of DNA after treatment with a restriction enzyme 
that cleaves at several sites. The fractions determined to have no deoxyribonuclease 
activity (peak activity of pol>'merase elutes at 0.28 to 0.455 M KCl) are pooled and 
dialyzed against CM-Trisacryl buffer. The precipitate foraied is removed by 

25 centrifugation. . 

In the sixth step, the supernatant is applied to a CM-Trisacryl column 
equilibrated with 50 mM NaCl. The column is washed witii 50 mM NaCl and the 
enzyme eluted with a linear gradient of a buffer such as 0.05 to 0.4 M NaCl. The 
pooled fractions having thermostable polymerase activity and no deoxyribonuclease 

30 activity elute at 0.16 to 0.20 M NaCl. 

The molecular weight of the dialyzed prcxluct may be determined by any 
technique, for example, by SDS-PAGE analysis using protein molecular weight 
maikers. The molecular weight of the DNA polymerase purified from Themius 
thermophilus is determined by the above method to be about 94 kDa. The molecular 

35 wdghtof this same DNA polymerase as determined by the predicted amino acid 

sequence is calculated to be approximately 94,016 daltons. The purification protocol of 
native Tth DNA polymwase is described in detail in Example 1. Purification of the 
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recombinant Tth polymerase of the invention can be carried out with similar 
methodology. 

An important aspea of the present invention is the production of recombinant 
Tth DNA polymerase. As noted above, the gene encoding this enzyme has been cloned 

5 fiom Thermus fhermophilus genomic DNA. The complete coding sequence (-2.5 kb) 
for the Tth polynaerasc can be easily obtained in an -3.7 Idlobase (kb) JHindlll-EsiEII 
restriction fragment of plasmid pBSMtTthlO, although this -3.7 kb fragment contains 
an internal HindUI restriction enzyme recognition site. This plasmid was deposited 
with the American Type Culttire Collection (ATCC) in host cell £• £2li K12 strain 

10 DGlOl on December 21, 1989, under accession No. 68195. 

The con:q)lete coding sequence and deduced amino acid sequence of the 
thermostable Tth DNA polymerase enzyme is provided above. The entire coding 
sequence of the Ttii DNA polymerase gene is not required, however, to produce a 
biologically active gene product with DNA polymerase activity. The availability of 

15 DNA encoding tiie Tth DNA polymerase sequence provides the oppominity to modify 
the coding sequence so as to generate mutein (mutant protein) forms also having DNA 
polymerase activity. Amino(N)-terminal deletions of the protein, up to about one-third 
of die protein, are not believed to destroy polymerase activity of die remaining 
fragment, and recombinant truncated proteins, created by deleting approximately one- 

20 tenth of the coding sequence (for the amino-terminus), arc quite active in polymerase 
assays. Because certain N-teiminal shortened forais of the polymerase are active, die 
gene constructs used for expression of these polymerases can include the 
corresponding shortened forms of die codinjg sequence. 

In addition to the N-terminal deletions, individual anuno acid residues in the 

25 peptide chain comprising Tth polymerase may be modified by oxidation, reduction, or 
otfier derivation, and the protein may be cleaved to obtain fragments tiiat retain activity. 
Such alterations that do not destroy activity do not remove the protein from die 
definition of a protein witfi Ttfi polymerase activity and so are specifically included 
wiUun the scope of die present invention. Modifications to die primary structure of die 

30 Tdi gene DNA polymerase by deletion, addition, or alteration so as to change die amino 
acids incorporated into die Tdi DNA polymerase during translation can be made vdUiout 
destroying the high temperature DNA polymerase activity of the protein. Such 
substimtions or other alternations result in die production of proteins having an amino 
add sequence encoded by DNA falling widiin the contemplated scope of the present 

35 invention. Likewise, the cloned genomic sequence, or homologous syndietic 
sequences, of die Tdi DNA polymerase gene can be used to express a fusion 
polypeptide with Tdi DNA polymerase activity or to express a protein widi an amino 
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add sequence identical to that of native Tth DNA polymerase. In addition, such 
expression can be directed by the Tth DNA polymerase gene control sequences or by a 
control sequence that functions in whatever host is chosen to express the Tth DNA 
polymerase. 

5 Thus, the present invention provides the complete coding sequence for Ttii 

DNA polymerase fiom which expression vectors applicable to a variety of host systems 
can be constructed and the coding sequence expressed. Portions of the Tth polymerase- 
encoding sequence are also useful as probes to retrieve other thermostable polymcrase- 
encoding sequences in a variety of species. Accordingly, portions of the genomic DNA 

10 encoding at least four to six amino acids can be replicated in £. cqU and die denamred 
forms used as probes or oligodeoxyribonucleotide probes that encode at least four to 
sbc amino acids can be synthesized and used to retrieve additional DNAs encoding a 
thermostable polymerase. Because there may not be an exact match between the 
nucleotide sequence of the thermostable DNA polymerase gene of Thermus 

15 therniophilus and the corresponding gene of other species, oligomers containing 
approximately 12-18 nucleotides (encoding the four to six amino acid sequence) are 
usually necessary to obtain hybridization under conditions of sufficient stringency to 
eliminate false positives. Sequences encoding six amino acids supply ample 
information for such probes. 

20 The present invention, by providing the coding and amino acid sequences for 

Tth DNA polymerase, therefore enables the isolation of other thermostable polymerase 
enzymes and the coding sequences for tiiose enzymes. The Taq and Tth DNA 
polymerase coding sequences are very similar, and this similarity facilitated tiie 
identification and isolation of the Tth DNA polymerase coding sequence. The regions 

25 of dissimilarity between the Taq and Tth DNA polymerase coding sequences can also 
be used as probes, however, to identify other themiostable polymerase coding 
sequences that encode enzymes quite divergent from, for example, Taq polymerase but 
similar to Ttii polymerase. ' 

Several such regions of dissimilarity between the Taq and Tth DNA polymerase 

30 coding sequences exist. These regions include the sequences for codons 225-230; 238- 
246; 241-249; 335-343; 336-344; 337-345; 338-346; and 339-347. For regions nine 
codons in length, probes corresponding to these regions can be used to identify and 
isolate themiostable polymerase encoding DNA sequences that are identical (and 
complementary) to the probe for a contiguous sequence of at least five codons. For the 

35 region six codons in length, a probe corresponding to this region can be used to 

identify and isolate thermostable polymerase-encoding DNA sequences that are identical 
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to the probe for a contiguous sequence of at least four codons. Such thermostable 
polymerase-cncoding DNA sequences need not be from a 
Themius themK>philus species, or even from the genus Thermus, to be isolated, so 
, long as the requisite homology is present 

Whether one desires to produce an enzyme identical to native Tth DNA 
polymerase or a derivative or homologue of that enzyme, the production of a 
recombinant form of Tth polymerase typically involves the construction of an 
expression vector, tiie transformation of a host cell with the vector, and culture of the 
transformed host cell under conditions such that expression will occur. To construct 
the expression vector, a DNA is obtained that encodes the mature (used here to includ 
all muteins) enzyme or a fusion of the Tth polymerase to an additional sequence that 
does not destroy activity or to an additional sequence cleavable under controlled 
conditions (such as treatment witii peptidase) to give an active protein. The coding 
sequence is tficn placed in operable linkage with suitable control sequences in an 
expression vector. The vector can be designed to replicate autonomously in die host 
cell or to integrate into the chromosomal DNA of the host cell. The vector is used to 
transform a suitable host, and the transformed host is cultured under conditions suitab 
for expression of recombinant Tth polymerase. The Tth polymerase is isolated from 
the medium or from tiie cells; recovery and purification of the protein may not be 
necessary in some instances, where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of ways. For example the 
desired coding sequence may be obtained from genomic fragments and used direcUy i 
appropriate hosts. The construction for expression vectors operable in a variety of 
hosts are made using appropriate replicons and control sequences, as set forth genei-all 
below. Construction of suitable vectors containing the desired coding and control 
sequences employs standard ligation and restriction techniques that arc well underst 
in the art. Isolated plasmids, DNA sequences, or synthesized oligonucleotides are 
cleaved, modified, and rcligated in the forai desired. Suitable restriction sites can, if 
not normally available, be added to the ends of tiie coding sequence so as to facilitate 
construction of an c)q)ression vector, as exemplified below. 

Site-specific DNA cleavage is performed by trcatiiig with the suitable restrictio 
enzyme (or enzymes) under conditions tiiat arc generally understood in the art and 
specified by the manufacturers of commercially available restriction enzymes. See, 
e.g.. New England Biolabs, Product Catalog. In general, about 1 \ig of plasmid or 
other DNA is cleaved by one unit of enzyme in about 20 nl of buffer solution; in the 
examples below, an excess of restriction enjgrme is generally used to ensure complete 
digestion of tiie DNA. Incubation times of about one to two hours at about 37*C are 
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typical, although variations can be tolerated. After each incubation, protein is removed 
by extraction with phenol and chloroform; this extraction can be followed by ether 
extraction and recovery of the DNA finom aqueous Mictions by precipitation with 
ethanol. If desired, size separation of the cleaved fragments may be perforaied by 
5 polyaciylamide gel or agarose gel electrophoresis using standard techniques. See. e.g., 
Mslb&dS in Enzymologv. 1980, ^:499-560. 

Resttiction^leaved fragments with single-strand "overhanging" termini can be 
made blunt-ended (double-strand ends) by treating with the large fragment of £. cell 
DNA polymerase I (Klenow) in the presence of the four deoxynucleoside triphosphates 

10 (dNTPs) using incubation times of about 15 to 25 minutes at 20 to 25'C in 50 mM Tris 
pH 7.6, 50 mM NaQ. 10 mM MgCb, 10 mM DTT and 5 to 10 ^iM dNTPs. The 
Klenow fragment fills in at 5' protruding ends, but chews back protruding 3' single 
strands, even though the four dNTPs are present. If desired, selective repair can be 
performed by supplying only one of the, or selected, dNTPs within the limitations 

15 dictated by the nature of tiie protruding ends. After treatment with Klenow, the mixture 
is extracted with phenol/chloroforai and eihanol precipitated. Similar results can be 
achieved using SI nuclease,. because treatment under appropriate conditions with S 1 
nuclease results in hydrolysis of any single-stranded portion of a nucleic acid. 
Syntiietic oligonucleotides can be prepared using the triester method of 

20 Matteucci £t al., 198 1 , 1 Am- Olsm. SfiC. iii2:3 1 85-3 191 or automated syntiiesis 
methods. Kinasing of single strands prior to annealing or for labeling is achieved using 
an excess, e.g., cpproximately 10 units, of polynucleotide kinase to 0.5 nM substrate 
in the presence of 50 mM Tris, pH 7.6, 10 mM MgCla, 5 mM diUiiotiireitol (DTT), and 
1 to 2 nM ATP. If kinasing is for labeling of probe, the ATP will contain high specific 

25 activity Y-32P. 

Ligations are performed in 15-30 ^il volumes under tiie following standard 
conditions and temperatures: 20 mM Tris-Cl, pH 7.5, 10 mM MgCh, 10 mM DTT, 33 
Hg/ml BSA, 10 mM-50 mM NaQ, and eitiier 40 ^M ATP and 0.01-0.02 (Weiss) units 
T4 DNA ligase at O'C (for ligation of fragments with complementary single-stranded 

30 ends) or 1 mM ATP and 0.3-0.6 units T4 DNA ligase at 14'C (for "blunt end" 

ligation), totermolecular ligations of fraginents with complementary ends arc usually 
performed at 33-100 jig/ml total DNA concentrations (5-100 nM total ends 
concentration). Intermolecular blunt end ligations (usually employing a 10-30 fold 
molar excess of linkers) are perforatied at 1 jiM total ends concentration. 

35 In veaor construction, the vector fragment is conunonly treated with bacterial or 

calf intestinal alkaline phosphatase (BAP or CIAP) to remove the 5' phosphate and 
prevent religation and reconstruction of the vector. BAP and CHAP digestion 
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additions are well known in the art, and published protocols usually accompany the 
commercially available BAP and CIAP enzymes. To recover the nucleic acid 
fragments, the preparation is extracted with phenol-chloroform and ethanol precipitated 
to remove AP and purify the DNA. Alternatively, religation can be prevented by 
5 restriction enzyme digestion of unwanted vector fragments before or after ligation of the 
desired vector. 

For portions of vectors or coding sequences that require sequence 
modifications, a variety site-specific primer-directed mutagenesis methods are available. 
The polymerase chain reaction (PCR) can be used to perform site-specific mutagenesis. 
10 In another technique now standard in the art, a synthetic oligonucleotide encoding the 
desired mutation is used as a primer to direct synthesis of a complementary nucleic acid 
sequence of a single-stranded vector, such pBS13+, that serves as a template for 
construction of the extension product of the mutagenizing primer. The mutagenized 
DNA is transformed into a host bacterium, and culmres of the transformed bacteria are 
15 plated and identified. The identification of modified vectors may involve transfer of the 
DNA of selected transformants to a nitrocellulose filter or otiier membrane and the 
"lifts" hybridized with kinased synthetic primer at a temperature that permits 
hybridization of an exact match to the modified sequence but prevents hybridization 
witii the original strand. Transfoimants tiiat contain DNA tiiat hybridizes witii tiie 
20 probe are then cultured and serve as a reservoir of the modified DNA. 

In the constructions set forth below, correct ligations for plasmid construction 
are confirmed by first transforming £. soli strain DGlOl or another suitable host, with 
the ligation mixture. Successful transformants are selected by ampicillin, tetracycline or 
other antibiotic resistance or sensitivity or by using otiier markers, depending on tiie 
25 mode of plasmid construction, as is understood in the art. Plasmids from the 

transformants arc then prepared according to tiie method of Clewell si al- 1969, ErQ£. 
Nad . Acad . Ssi- USA 62:1159. optionally following chloramphenicol amplification 
(Clewell, 1972, 1. Bacteriol . 1111:667). Anotiier metiiod for obtaining plasmid DNA is 
described as tiie "Base-Acid" extraction metiiod at page 11 of tiie Betiiesda Research 
3Q Laboratories publication Focus , volume 5, number 2, and very pure plasmid DNA can 
be obtained by replacing steps 12 tiuough 17 of tiie protocol witii CsCl/etiudium 
bromide ultracentrifiigation of tiie DNA. The isolated DNA is analyzed by restriction 
enzyme digestion and/or sequenced by tiie dideoxy metiiod of Sanger si M-. 1977, 
Proc. Nati . Acad . Sfl. USA 24:5463, as furtiier described by Messing si al-, 198 1 , 
35 Uufi. Acids Res . 2:309, or by tiie metiiod of Maxam si al-. 1980, MgthodS in 
pnzymology 65:499. 
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The control sequences, expression vectors, and iransformation methods are 
dependent on the type of host cell used to express the gene. Generally, pnxiaiyotic, 
yeast, insect, or mamm al i an cells are used as hosts. Procaryotic hosts arc in general the 
most efficient and convenient for the production of recombinant proteins and are 
5 therefore preferred for the expression of Tth polymerase. 

The procaiyote most fircquently used to express recombinant proteins is £. soJ^. 
For cloning and sequencing, and for expression of constructions under control of most 
bacterial promoters, E cqU K12 strain MM294, obtained from the £. sqU Genetic 
Stock Center under GCSC #6135, can be used as the host For expression vectors 
10 with the PlNrbs control sequence, £• coli K12 strain MClOOO lambda lysogen. 
N7N53CI857 SusPgo, ATCC 3953 1 , may be used. £. cdi DG 116. which was 
deposited with tiie ATCC (ATCC 53606) on April 7, 1987, and £. cqU KB2, which 
was deposited with the ATCC (ATCC 53075) on March 29, 1985, are also useful host 
cells. For M13 phage recombinants, £. soh. strains susceptible to phage infection, such 
15 as E. £Qli K12 strain DG98, are employed. The DG98 strain was deposited with the 
ATCC (ATCC 39768) on July 13. 1984. 

However, microbial strains other than £. sqU can also be used, such as bacilli, 
for example Bacillus suhtili.s. various species of Pseudomonas, and other bacterial 
strains, for recombinant expression of Tth DNA polymerase. !n such p^ocaI>'or^ : 
20 systems, plasmid vectors that contain replication sites and control sequences derived 
from the host or a species compatible with the host are typically used. 

For example, ssM. is typically transforaied using derivatives of pBR322, 
described by Bolivar si al., 1977, Gene 2:95. Plasmid pBR322 contains genes for 
ampicillin and tetracycline resistance. These drug resistance markers can be either 
25 retained or destroyed in constructing die desired vector and so help to detect the 
presence of a desired recombinant Commonly used procaryotic control sequences, 
i.e., a prompter for transcription initiation, optionally with an operator, along with a 
ribosome binding site sequence, include the p-lactamase (penicillinase) and lactose (lac) 
prpinoter systems (Chang £i al., 1977, IMSSS. 1056), the tryptophan (trp) 
30 promoter system (Goeddel £i al., 1980. Muc Acids EsS- S:4057). and the lambda- 
derived Pl promoter (Shimatakefiial., 1981. Nature 292 :128^ and N-gene ribosome 
binding site (Nrbs)- A portable control system cassette is set forth in U.S. Patent No. 
4,711,845, issued December 8, 1987. This cassette comprises a Pl promoter operably 
linked to the Nrbs in turn positioned upstream of a third DNA sequence having at least 
35 one restriction site that permits cleavage within sue bp 3' of the Nrbs sequence. Also 
useful is the phosphatase A (phoA) system described by Chang £t al- in European 
Patent Publication No. 196,864, published October 8, 1986. However, any available 
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promoter system compatible with procaryotcs can be used to constra 
vector of the invention. 

In addition to bacteria, eucaryotic microbes, such as yeast, can also be used as 
recombinant host cells. Laboratory strains of Saccharomvces cerevisiae . Baker's yeast, 

5 arc nwst often iised, although a number of other strains are conmonlyavailab^ While 
vectors employing the two micron origin of replication are common (Broach, 1983. 
Mcth. Enz> iQl:307), other plasmid vectors suitable for yeast expression are known 
(see, for example, Stinchcomb fit aIm 1979. NaM£ 222:39; Tschempe si al., 1980, 
Gene 10:157: and Clarke fit al- 1983, Meth . Enz; 1111:300). Ci)ntrol sequences for 

10 yeast vectors include pronwters for the syntiiesis of glycolytic enzymes (Hess £i al-, 
1968, 1 Adv : Enzyme Reg . 2:149, and Holland fit fll., 1978, Biotechnologv 17:4900). 
Additional promoters known in die art include the promoter for 3-phosphoglycerate 
kinase (Hitzeman £l al-, 1980, L BioL Chem . 255:2073) and tiiose for otiicr glycolytic 
enzymes, such as glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate 

15 decarboxylase, phosphofiructokinase, glucose-6-phosphate isomcrase, 3- 
phosphoglycerate mutase, pyruvate kinase, trioscphosphate isomerase, 
phosphoglucosc isomerase, and glucokinase. Other promoters that have the additional 
advantage of transcription controlled by growth conditions are the promoter regions for 
alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes 

20 associated with nitrogen metabolism, and enzymes responsible for maltose and 
galactose utilization (Holland, SHjaa). 

Teraiinator sequences may also be used to enhance expression when placed at 
the 3* end of the coding sequence. Such terminators are found in the 3' untranslated 
region following the coding sequences in yeast-derived genes. Many vectors contain 

25 control sequences derived from the enolase gene contained in plasmid peno46 (Holland 
filfll., 1981, 1 Biol. Chfim. 256:1385).or die LEU2 gene obtained from YEpl3 
(Broach fii al- 1978, Gene fi:121); however, any vector containing a yeast-compatible 
promoter, origin of replication, and other control sequences is suitable for use in 
constructing yeast Ttii expression vectors. 

30 The Ttii gene can also be expressed in eucaryotic host cell cultures derived from 

multicellular organisms. See, for example, Tissue Culture ^ Academic Press, Ouz and 
Patterson, editors (1973). Usefiil host ceU lines include COS-7, C0S-A2, CV-1 , 
murine cells such as murine myelomas N51 and VERO, HeLa cells, and Chinese 
hamster ovary (CHO) cells. Expression vectors for such cells ordinarily include 

35 promoters and control sequences compatible with mammalian cells such as, for 

example, the commonly used early and late promoters from Simian Virus 40 (SV 40) 
(Fiers fil M.. 1978. Nature 273 :113V or other viral promoters such as tiiose derived 
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from polyoma, adenovirus 2, bovine papilloma vims (BPV), or avian sarcoma viruses, 
or immunoglobulin promoters and heat shock promoters. A system for expressing 
DNA in mamm a li a n systems using a BPV vector system is disclosed in U.S. Patent 
No. 4,419,446. A modificadon of this system is described in U.S. Patent No. 
5 4,601,978. GenCTal aspects of mamimalian cell host system transformations have been 
described by Axel, U.S. Patent No. 4,399,216. "Enhancer" regions are also important 
in optimizing expression; these are, generally, sequences found upstream of the 
promoter region. Origins of replication may be obtained, if needed, from viral sources. 
However, integration into the chromosome is a common mechanism for DNA 

10 replication in eucaiyotes. 

Plant cells can also be used as hosts, and control sequences compatible with 
plant cells, such as the nopaline syntfiase promoter and polyadenylation signal 
sequences (Depicker si al.» 1982, 1 iM. iS^. fifin- 1:561) are available. Expression 
systems employing insect cells utilizing the control systems provided by baculovirus 

15 vectors have also been described (MiUer £i al., in Generic Enpineering (1986) Setlow £i 
al., eds., Plenum Publishing, Vol. 8, pp. 277-297). Insect cell-based expression can 
be accomplished in Spodoptera fm gipeida . These systems are also successful in 
prcxlucing recombinant Tth polymerase. 

Depending on the host cell used, o^sformarion is done using standard 

20 techniques appropriate to such cells. The calcium treatment employing calcium 

chloride, as described by Cohen, 1972, Eeq£. IM- Acad . Sci. USA ^:21 10 is used 
for procaryotes or otiier cells that contain substantial cell wall barriers. Infection with 
Aerobacterium mmefaciens (Shaw fit al., 1983, Gene 22:315) is used for certain plant 
cells. For mammalian cells, the calcium phosphate precipitation metiiod of Graham and 

25 van der Eb, 1978, Virology 52:546 is preferred. Transformations into yeast are carried 
out accoidihg to the method of Van Solingen fli., 1977, 1 BacL 12Q:946 and Hsiao 

Si al., 1979, Eeqs. Acad. Sci. USA 26:3829. 

Once the Tth DNA polymerase has been expressed in a recombinant host cell, 
purification of the protein may be desired. Although the purification procedures 

30 previously described can be used to purify the recombinant tiiermostable polymerase of 
the invention, hydrophobic interaction chromatography purification methods are 
preferred. Hydrophobic interaction chromatography is a separation technique in which 
substances are separated on tiie basis of differing strengths of hydrophobic interaction 
with an uncharged bed material containing hydrophobic groups. Typically, the column 

35 is first equilibrated under conditions favorable to hydrophobic binding, e.g., high ionic 
strength. A descending salt gradient may be used to elute the sample. 
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According to the invention, the aqueous mixture (containing either native or 
recombinant Tth DNA polymerase) is loaded onto a column containing a relatively 
strong hydrophobic gel such as phenyl sepharose (manufactured by Pharaxacia) cr 
Phenyl TSK (manufactured by Toyo Soda). To promote hydrophobic interaction with 

5 a phenyl sepharose column, a solvent is used which contain:, for example, greater than 
or equal to 0.2 M ammonium sulfate, with 0.2 M being pin?^er fhe column and the 
sample are adjusted to 0.2 M anmionium sulfate in 50 mM Tris, pH 7.5, and 1 mM 
EDTA CTE") buffer ttiat also contains 1 mM DTT and the sample applied to the 
column. The column is washed with the 02 M f inmonium sulfate buffer. The enzyme 

10 tnay then be eluted with solvents which attenuate hydrophobic interactions such as, for 
example, decreasing salt gradients, ethylene or propylene glycol, or urea. For 
recombinant Tth polymerase, a preferred embodiment involves washing the column 
sequentially witii the Tris-EDTA buffer and tiie Tris-EDTA buffer containing 20% 
ethylene glycol. The Tth polymerase is subsequentiy eluted from the column with a 0 

15 to 4 M urea gradient in the Tris-EDTA ethylene glycol buffer. 

For long-term stability, Ttii DNA polymerase enzyme is stored in a buffer that 
contains one or more non-ionic polymeric detergents. Such detergents are generally 
those that have a molecular weight in the range of approximately 100 to 250,000 
preferably about 4,000 to 200,000 daltons and stabilize die enzyme at a pH of from 

90 about 3.5 to about 9.5, preferably from about 4 to 8.5. Examples of such detergents 
include tiiose specified on pages 295-298 of McCutcheon*s Emulsifiers & Detergents, 
North American edition (1983), published by the McCutcheon Division of MC 
Publishing Co., 175 Rock Road, Glen Rock, NJ (USA), die entire disclosure of which 
is incoiporated herein by reference. Rfeferably, the detergents are selected from the 

25 group comprising ethoxylated fatty alcohol ethers and lauryl etiiers, etiioxylatcd alkyl 
phenols, octylphenoxy polyethoxy ethanol compounds, modified oxyethylated and/or 
oxypropylated straight-chain alcohols, polyethylene glycol monooleate compounds, 
polysorbate compounds, and phenolic fatty alcohol ethers. More particularly preferred 
are Tween 20, a polyoxyethylatcd (20) soriDitan monolaurate from ICI Americas Inc., 

30 Wilmington, D.E., and Iconol™ NP-40, an ethoxylated alkyl phenol (nonyl) from 
BASF Wyandotte Corp. Parsippany, NJ, 

The thermostable enzyme of this invention may be used for any purpose in 
which such enzyme activity is necessary or desired. In a particularly preferred 
embodiment, the enzyme catalyzes the nucleic acid amplification reaction known as 

35 PGR. This process for amplifying nucleic acid sequences is disclosed and claimed in 
U.S. Patent No. 4,683,202^ issued July 28, 1987, die disclosure of which is 
incorporated herein by reference. The PGR nucleic acid amplification method involves 
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amplifying at least one specific nucleic acid sequence contained in a nucleic acid or a 
mixture of nucleic acids and produces double-stranded DNA. 

For ease of discussion, the protocol set forth below assumes that the specific 
sequence to be amplified is contained in a double-stranded nucleic acid. However, the 

5 process is equally useful in amplifying single-stranded nucleic acid, such as mRNA, 
although in the preferred embodiment the ultimate product is still double-stranded 
DNA. In the amplification of a angle-s&anded nucleic acid, the first step involves the 
synthesis of a complementary strand (one of the two amplification primers can be used 
for this purpose), and the succeeding steps proceed as in the double-stranded 
10 amplification process described below. • 

This amplification process comprises the steps of: 

(a) contacting each nucleic acid strand with four different nucleoside 
triphosphates and one oligonucleotide primer for each strand of tiie specific sequence 
being amplified, wherein each primer is selected to be substantially complementary to 
15 the different strands of the specific sequence, such that the extension product 

synthesized torn one primer, when it is separated from its complement, can serve as a 
template for synthesis of the extension product of the other primer, said contacting 
being at a temperature which allows hybridization of each primer to a complementary 
nucleic acid so^d; 

20 (b) contacting each nucleic acid strand; at the same time as or after step (a), 

with a DNA polymerase from Thcrmys thermophilus which enables combination of the 
nucleoside triphosphates to form primer extension products complementary to each 
strand of the ispecific nucleic acid sequence; 

(c) maintaining the mixnire from step (b) at an effective temperature for an 
25 effective time to promote the activity of die enzyme and to synthesize, for each different 

sequence being amplified, an extension product of each primer which is complementary 
to each nucleic acid strand template, but not so high as to separate each extension 
product from the complementary strand template; 

(d) heating die mbcturefirom step (c) for an effective time and at an effective 
30 temperature to separate the primer extension products from tfie templates on which they 

were syntiiesized to produce single-stranded molecules but not so high as to denanire 
irreversibly the enzyme; 

(e) cooling die nuxnire from step (d) for an effective time and to an effective 
temperature to promote hybridization of a primer to each of die single-stranded 

; 35 molecules produced in step (d); and 

(f) maintaining die mbenirc from step (e) at an effective temperature for an 
effective rime to promote the activity of die enzyme and to syntiiesize, for each different 
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sequence being amplified, an extension product of each primer which is complementary 
to each nucleic acid strand template produced in step (d) but not so high as to separate 
each extension product from the complementary strand ten:5)late. The effective times 
and temperatures in steps (e) and (f) may coincide, so that steps (e) and (f) can be 
5 carried out simultaneously. Steps (d)-(O are repeated until the desired level of 
amplification is obtained 

The amplification method is useful not only for producing large amounts of a 
specific nucleic acid sequence of known sequence but also for producing nucleic acid 
sequences which are known to exist but arc not completely specified. One need knov/ 

10 only a sufficient number of bases at both ends of the sequence in sufficient detail so that 
two oligonucleotide primers can be prepared which will hybridize to different strands of 
the desired sequence at relative positions along the sequence such that an extension 
product synthesized from one primer, when separated from the template (complement), 
can serve as a template for extension of the other primer into a nucleic acid sequence of 

15 defined length. The greater the knowledge about the bases at both ends of the 

sequence, the greater can be the specificity of the primers for the target nucleic acid 
sequence and the efficiency of the process. In any case, an initial copy of the sequence 
to be amplified must be available, although the sequence need not be pure or a discrete 
molecule. In general, the amplification process involves a chain reaction for producing, 

2Q in exponential quantities relative to the number of reaction steps Lnvolved, at least one 
specific nucleic acid sequence given that (a) the ends of tiie required sequence are 
known in sufficient detail that oligonucleotides can be syndiesized which will hybridize 
to them, and (b) that a small anwunt of the sequence is available to initiate the chain 
reaction. The product of the chain reaction will be a discrete nucleic acid duplex with 

25 termini corresponding to the ends of the specific primers employed. 

Any nucleic acid sequence, in purified or nonpurified form, can be utilized as 
the starting nucleic acid(s), provided it contains or is suspected to contain the specific 
nucleic acid sequence desired. The nucleic acid to be amplified can be obtained from 
any source, for example, from plasmids such as pBR322, from cloned DNA or RNA, 

3Q or from natural DNA or RNA from any source, including bacteria, yeast, viruses, 
organelles, and higher organisms such as plants and animals, DNA or RNA may be 
extracted from blood, tissue material such as chorionic villi, or amniotic cells by a 
variety of techniques. See, e.g., Maniatis £t al., supra , pp. 280-28 1 , Thus, the 
process may employ, for example, DNA or RNA, including messenger RNA, which 

35 DNA or RNA may be single-stranded or double-stranded. In addition, a DNA-RNA 
hybrid which contains one strand of each may be utilized, A mixture of any of these 
nucleic acids can also be employed as can nucleic acids produced from a previous 
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amplification reaction (using the same or dififerent primers). The specific nucleic acid 
sequence to be amplified may be only a fraction of a laigc molecule or can be present 
initially as a discrete molecule, so that the qjccific sequence constitutes the entire 
nucldc acid. 

5 The sequence to be amplified need not be present initially in a pure forai; the 

sequence can be a minor fraction of a complex mbcture, such as a portion of die p- 
globin gene contained in whole human DNA (as exemplified in Saiki 1985, 
SflSDCfi 220:1530-1534) or a portion of a nucleic acid sequence due to a particular 
microorganism, which organism might constitute only a very minor fiction of a 
10 particular biological sample. The cells can be direcdy used in die amplification process 
after suspension in hypotonic buffer and heat treatment at about 90-100'C until cell 
lysis and dispersion of intracellular components occur (generally 1 to 1 5 minutes). 
After die heating step, die amplification reagents may be added direcdy to die lysed 
cells. The starting nucleic acid sequence may contain more dian one desired specific 
15 nucleic acid sequence. The amplification process is useful not only for producing large 
amounts of one specific nucleic acid sequence but also for amplifying simultaneously 
more than one different specific nucleic acid sequence located on the same or different 
nucleic acid molecules. 

Primers play a key role in die PCR process. The word "primer" as used in 
20 describing die amplification process can refer to more dian one primer, particulaiy in 
die case where diere is some ambiguity in die information regarxiing die terminal 
sequence(s) of die fragment to be amplified. For instance, in die case where a nucleic 
acid sequence is inferred from protein sequence information, a collection of primers 
containing sequences representing all iwssible codon variations based on degeneracy of 
25 die genetic code will be used for each strand. One primer fiom diis collection will be 
sufficiendy homologous widi die end of die desired sequence to be amplified to be 
usefid for amplification. 

In addition, more dian one specific nucleic acid sequence can be amplified fiom 
die first nucleic acid or mixnire of nucleic acids, so long as die appropriate number of " 
30 different oligonucleotide primers are utilized. For example, if two different specific 
nucleic acid sequences are to be produced, four primers are utilized. Two of die 
primers are specific for one of die specific nucleic acid sequences and die odier two 
primers are specific for die second specific nucleic acid sequence. In diis manner, each 
of die two different specific sequences can be produced exponentially by die present 
35 process. 

A sequence widiin a given sequence can be amplified after a given number of 
cycles to obtain greater specificity of die reaction by adding after at least one cycle of 
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amplification a set of primers that are complenientaiy to internal sequences (that are not 
at the ends) of the sequence to be amplified Such primers may be added at any stage 
and will provide a shorter amplified fragment Alternatively, a longer firagment can be 
prepared by using primers with non-complementary 5*-ends but having some overlap 

5 with the primers previously utilized in the anq)lification. 

Primers also play a key role when the amplification process is used for in vitro 
mutagenesis. The product of an amplification reaction where the primers employed are 
not exactly ccmpiementary to the original template will contain the sequence of the 
primer rather than the template, so introducing an in vitro mutation. In further cycles 

10 this mutation will be amplified with an undiminished efficiency because no further 
mispaiied priming is required. The process of making an altered DNA sequence as 
described above could be repeated on the altered DNA using different primers to induce 
further sequence changes. In this way, a series of mutated sequences can gradually be 
produced wherein each new addition to the series differs from the last in a minor way, 

15 but from the original DNA source sequence in an increasingly major way. 

Because the primer can contain as part of its sequence a non-complementary 
sequence, provided that a sufficient amount of the primer contains a sequence that is 
complementary to the strand to be amplified, many other advantages can be realized. 
For example, a nucleotide sequence that is not complementary to the template sequence 

20 (such as, e.g., a promoter, linker, coding sequence, etc.) may be attached at the 5* end 
of one or both of the primers and so appended to the product of the amplification 
process. After the extension primer is added, sufficient cycles are run to achieve the 
desired amount of new template containing the non-complementaiy nucleotide insen. 
This allows production of large quantities of the combined fragments in a relatively 

25 short period of time (e.g., two hours or less) using a simple technique. 

Oligonucleotide primers can be prepared using any suitable method, such as, 
for example, the phosphotriester and phosphodiester methods described above, or 
automated embodiments thereof. In one such automated embodiment, 
diethylphosphoramidites are used as starting materials and may be synthesized as 

30 described by Beaucage et aL. 198L Tetrahedron Letters 22:1859-1862. One method 
for syntiiesizing oligonucleotides on a nxxiified solid support is described in U.S. 
Patent No. 4,458,066. One can also use a primer that has been isolated from a 
biological source (such as a restriction endonuclease dijgcst). 

No matter what primers are used, however, the reaction mixture must contain a 

35 template for PGR to occur, because the specific nucleic acid sequence is produced by 
using a nucleic acid containing that sequence as a template. The first step involves 
contacting each nucleic acid strand with four different nucleoside triphosphates and one 
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oligonucleotide primer for each strand of each specific nucleic acid sequence being 
anq)lified or detected. If the nucleic acids to be amplified or detected are DNA, then the 
nucleoside triphosphates are usuaUy dATP. dCTP, dGTP, and TTP. aldiough various 
nucleotide derivatives can also be used in the process. Hie concentration of nucleotide 
5 triphosphates can vary widely. Typically the concentration is 50-200 \iM in each dNTP 
in die buffer for amplification, and MgCh is present in die buffer in an amount of 1 to 
3 mM to increase the efficiency and specificity of die reaction. However, dNTP 
concentrations of 1-20 \jM may be preferred for some applications, such as DNA 
sequencing. 

10 The nucleic acids strands of the target nucleic acid serve as teihplates for the 

synthesis of additional nucleic acids strands, which are extension products of die 
primers. This syndiesis can be perforaied using any suitable method, but generally 
occurs in a buffered aqueous solution, preferably at a pH'of 7-9, most preferably about 
8. To facilitate synthesis, a molar excess (for cloned nucleic acid, usually about 1000: 1 

15 primentemplate and for genomic nucleic acid, usually about 108; 1 primentemplate) of 
the two oligonucleotide primers is added to die buffer containing Lhe template strands. 
As a practical matter, the amount of primer added will generally be in molar excess over 
the amount of complementary so^d (template) when the sequence to be amplified is 
contained in a mixture of complicated long-chain nucleic acid strands. A large molar 

The mbcnire of template, primers, and nucleoside triphosphates is then treated 
accOTding to whedier die nucleic acids being amplified or detected are double- or single- 
stranded. If die nucleic acids are single-stranded, tfien no denaturation step need be 

employed, and the reaction mixture is held at a temperanire which promotes 
25 hybridization of die primer to its complementary target (template) sequence. Such 
temperature is generally finom about 35'C to 65*C or more, preferably about 37-60*C 
for an effective time, generally fiom a few seconds to five minutes, preferably from 30 
seconds to one minute. A hybridization temperanuc of 45-58'C is used for Tdi DNA 
polymerase, and 15-mer or longer primers are used to increase die specificity of primer 
. 30 hyteidization. Shoner primers require lower hybridization temperatures. The 

complement to die original single-stranded nucleic acids can be syndiesized by adding 
Tth DNA polymerase in the presence of the appropriate buffer, dNTPs, and one or 
more oligonucleotide primers. If an appropriate single piimer is added, die primer 
extension product will be complementary to die single-stranded nucleic acid and will be 
35 hyteidized with die nucleic acid strand in a duplex of strands of equal or unequal length 
(depending on where die piimer hybridizes to die template), which may dien be 
separated into single strands as described above to produce two single, separated. 
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complementary strands. Alternatively, two or more appropriate primers (one of which 
will prime synthesis using the extension product of the other primer as a template) may 
be added to the single-stranded nucleic acid and the reaction carried out 

If the nucleic acid contains two strands, as in the case of amplification of a 
5 double-stranded target or second-cycle amplification of a single-stranded target, the 
strands of nucleic acid must be separated before the primers are hybridized. This strand 
separation can be accomplished by any suitable denaturing method, including physical, 
chemical or enzymatic means. One preferred physical method of separating the strands 
of the nucleic acid involves healing the nucleic acid until complete (>99%) denaturation 

IQ occurs. Typical heat denaturation involves temperaranes ranging from about 90 to 
105*C for times generally ranging from about a few seconds to 5 minutes, depending 
on the composition and size of the nucleic acid. Preferably, the effective denaturing 
temperature is 9O-100*C for 10 seconds to 3 minutes. Strand separation may also be 
induced by an enzyme from the class of enzymes known as helicases or the enzyme 

15 RecA, which has helicase activity and in the presence of riboATP is known to denature 
DNA. The reaction conditions suitable for separating the strands of nucleic acids with 
helicases are described by Kuhn Hoffmann-Berling, 1978, CSH-Ouantitativc Biology 
42:63, and techniques for using RecA are reviewed in Radding, 1982, Ann . Rev . 
Genetics 1^:405-437. The denaturation produces two separated complementary strands 

— <^ WA, V^VMAJ. VTA WAlV^UiU AWll^Ul. 

If the double-stranded nucleic acid is denatured by heat, the reaction mixture is 
allowed to cool to a temperatuie which promotes hybridization of each primer to the 
complementary target (template) sequence. This temperature is usually from about 
35*C to 65'C or more, depending on reagents, preferably 37-60*C. The hybridization 
25 temperature is maintained for an effective time, generally 30 seconds to 5 minutes, and 
preferably 1-3 minutes. In practical terms, the temperature is simply lowered from 
about 95*C to as low as 37*C, and hybridization occurs at a temperature within this 
range. 

Whether the nucleic acid is single- or double-stranded, the DNA polymerase 
30 from Thermus themiophilus may be added at the denaturation step or when the 
temperature is being reduced to or is in the range for pronwting hybridization. 
Although the themaostability of Tth polymerase allows pne to add Tth polymerase to the 
reaction mixture at any time, one can substantially inhibit non-specific amplification by 
adding the polynicrase to the reaction mixture at a point in time when the mixture will 
35 not be cooled below tiie stringent hybridization temperature. After hybridization, the 
reaction mbcturc is ilien heated to or maintained at a temperature at which the activity of 
the enzyme is promoted or optimized, i.e., a temperature sufficient to increase the 
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activity of the cn2yme in facilitating syntiiesis of the primer extension products fix)m the 
hybridized primer and template. The temperature must actually be sufficient to 
synthesize an extension product of each primer which is complementary to each nucleic 
acid template, but must not be so high as to denature each extension product from its 
5 complementary template (ie., the temperature is generally less than about 80-90'C). 

Depending on the nucleic acid(s) employed, the typical temperature effective for 
this synthesis reaction generally ranges from about 40 to 80*0, preferably 50-75*C 
The temperature more preferably ranges finom about 65-75*C for Thcrmus 
thgimophilius DNA polymerase. The period of time required for tiiis synthesis may 

10 range from about 0.5 to 40 minutes or more, depending mainly on the temperature, the 
length of the nucleic acid, the enzyme, and the complexity of the nucleic acid mixture. 
The extension time is usually about 30 seconds to three minutes. If the nucleic acid is 
longer, a longer time period is generally required for complementary strand synthesis. 
The newly synthesized strand and the complement nucleic acid strand form a 

15 double-stranded molecule which is used in the succeeding steps of the amplification 
process. In the next step, tiie strands of die double-stranded molecule are separated by 
heat denaturation at a temperature and for a time effective to denature the molecule, but 
not at a temperature and for a period so long that the thermostable enzyme is completely 
and irreversibly denatured or inactivated. After tiiis denaturation of template, the 

2Q temperature is decreased to a level which promotes hybridization of the primer to the 
complementary single-stranded molecule (template) produced from the previous step, 
as described above. 

After this hybridization step, or concurrenUy with the hybridization step, the 
temperature is adjusted to a tempwature that is effective to promote the activity of the 

25 thermostable enzyme to enable synrfieisis of a primer extension product using as a 
template both the newly synthesized and the original strands. The temperature again 
must not be so high as to separate (denature) tiie extension product from its template, as 
described above. Hybridization may occur during this step, so that die previous step of 
cooling after denaturation is not required. In such a case, using simultaneous steps, the 

30 preferred temperature range is 50-70'C. 

The heating and cooling steps involved in one cycle of strand separation, 
hybridization, and extension product synthesis can be repeated as often as needed to 
produce the .desired quantity of the specific nucleic acid sequence. The only limitation 
is die amount of the primers, thermostable enzyme, and nucleotide triphosphates 

35 present Usiially, from 15 to 30 cycles are completed. For diagnostic detection of 
amplified DNA, the number of cycles will depend on the nature of the sample. For 
example, fewer cycles will be required if the sample being amplified is pure. If tiie 
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sanq)le is a complex mixture of nucleic acids, more cycles will be required to amplify 
the signal sufBciently for detection. For general amplification and detection, the 
process is repeated about 15 times. When amplification is used to generate sequences 
to be detected with labeled sequence-specific probes and when human genomic DNA is 

5 the target of amplification, the process is repeated 15 to 30 times to amplify the 
sequence sufficiently that a clearly detectable signal is produced. i.e., so that 
background noise does not interfere with detection. 

No additional nucleotides, primers, or themiostable enzyme need be added after 
the initial addition, provided that no key reagent has been exhausted and that the 

10 enzyme has not become denatmed or inactivated inevcrsibly, in which case additional 
polymerase or other reagent would have to be added for die reaction to continue. 
Addition of such materials at each step, however, will not adversely affect the reaction. 
After the appropriate number of cycles have been completed to produce the desired 
amount of the specific nucleic acid sequence, the reaction may be halted in the usual 

15 manner, e.g., by inactivating the enzyme by adding EDTA, phenol, SDS, or CHCI3 or 
by separating the components of the reaction. 

The amplification process may be conducted continuously. In one embodiment 
of an automated process, the reaction mixture may be temperature cycled such that the 
temperature is programmed to be controlled at a certain level for a certain time. One 

on such instniment for this purpose is the automated machine for handling the 

amplification reaction developed and marked by Peridn-Elmer Cetus Instruments. 
Detailed instructions for carrying out PGR with the instrament are available upon 
purchase of the instrument. 

Tth DNA polymerase is very useful in carrying out the diverse processes in 

25 which amplification of a nucleic acid sequence by the polymerase chain reaction is 
useful. The amplification method may be utilized to clone a particular nucleic acid 
sequence for insertion into a suitable expression vector, as described in U.S. Patent 
No. 4,800,159. The vector may be used to transform an appropriate host organism to 
produce the gene product of the sequence by standard methods of recombinant DNA 

3Q technology. Such cloning may involve direct ligation into a vector using blunt-end 
ligation, or use of restriction enzymes to cleave at sites contained within the primers. 
Other processes suitable for Tth polymerase include those described in U.S. Patent 
Nos. 4,683,194; 4,683,195; and 4,683,202 and European Patent Publication Nos. 
229,701; 237,362; and 258,017; these patents and publications are incorporated herein 

35 by reference. In addition, the present enzyme is usefiil in asymmetric PCR (see 
; Gyllensten and Erlich, 1988, Proc . Natl . Acad . Sfii. USA £5:7652-7656, incorporated 
herein by reference); inverse PCR (Ochman £t 1988, Generics 120:621. 
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incoiporatcd herein by reference); and for DNA sequencing (sec Innis fil 1 988, 
jEtoc. JM. Acad. USA 55:9436-9440. and McConlogue £i al., 1988. Muc- Acids 
BfiS. i^(20):9869). Tth polymerase also has reverse transcriptase activity. 

The following examples are offered by way of illustration only are by no means 
5 intended to limit the scope of the claimed invention, in these examples, all percentages 
are by weight if for solids and by volume if for liquids, unless otherwise noted, and all 
temperatures are given in degrees Celsius. 

Example 1 
Purification of Thenniis thermnphiln^: 

10 DNA PoWnia.. 

This example describes die isolation of Tth DNA polymerase from Thermus 
thermophilus. Tth DNA polymerase was assayed at various points during purification 
according to the method described for Taq polymerase in Lawyer £i al., 1989, J. Biol . 
Chfim. 2M(ll):6427-6437, incorporated herein by reference. 

15 Typically, diis assay is performed in 50 nl of a reaction mixmre composed of 

25 raM TAPS-HCl, pH 9.5 (20'C); 50 mM KCl; 2 mM MgCl2; 1 mM (3- 
mercaptoethanol; 200 jiM in each of dATP, dGTP, and TTP; 100 a-32p-dCTP 
(0.03 to.0.07 ^iCi/nmol); 12.5 ^g of activated salmon sperm DNA; and polymerase. 
The reaction is initiated by addition of poly-mcrasc in diluent (diluent is composed of 10 

20 mM Tris-HCl, pH 8.0, 50 mM KCl, 0.1 mM EDTA, 1 mg/ml autoclaved gelatin, 0.5% 
NP40, 0.5% Tween 20, and 1 mM p-mercaptoethanol), and the reaction is carried out 
at 74*C. After a 10 minute incubation, the reaction is stopped by adding 1 0 |il of 60 
mM EDTA. The reaction mixture is centrifiiged. and 50 pi of reaction mixture is 
transferred to 1.0 ml of 50 pg/ml carrier DNA in 2 mM EDTA (at O'C). An equal 

25 volume (1 ml) of 20% TCA. 2% sodium pyrophosphate is added and mixed. The 
mixture is incubated at O'C: for 15 to 20 minutes and then filtered through Whatman 
GF/C filters and extensively washed (6 x 5 ml) with a cold mixture containing 5% TCA 
and 1 % pyrophosphate, foUowed'by a cold 95% ethanol wash. The filters are then 
dried and the radioactivity counted. Background (minus enzyme) is usually 0.001% to 

30 0.01 % of input cpm. About 50 to 250 pmol of 32p-dCTP standard is spotted for unit 
calculation. One unit is equal to 10 nmpl of dNTP incorporated in 30 minutes at 74*C. 
Units arc calculated as follows. 



wo 91/09950 



PCr/US90/07639 



34 

sample q^m - enzvme dil. cpm = pmol incorporated 
specific activity of dCTP (cpm^Jniol) 

pmol incorpor ated x 3 x dilution factor x 4 = units/ml 
4.167x10 

5 Enzyme activity is not completely linear with time. With purified enzyme, a thirty 
minute assay is usually 2.S X a 10 minute assay. 

About 202 g of frozen Thermus thermophilus strain HB8 cells (ATCC No. 
27,634) were tiiawed in 100 ml of 3X TE-DTT buffer (150 mM Tris-Cl, pH 7.5, 3 
mM EDTA, and 3 mM dithiothreitol) containing 2.4 mM PMSF (from 144 mM stock in 
10 "DMF) and homogenized at low speed in a blender. All operations were earned out at 0 
to 4*C unless otherwise stated. All glassware was baked prior to use, and solutions 
used in the purification were autoclaved, if possible, prior to use. The thawed cells 
were lysed in an Aminco French pressure cell (18,000 psi), then diluted with an equal 
volume of IX TE-DTT buffer containing 2.4 mM PMSF and sonicated to reduce 

15 viscosity (1/3 aliquots, 80% output, 10 minutes, 50% duty cycle). The lysate was 
diluted with additional IX TE-DTT buffer containing fresh 2.4 mM PMSF to final 
5.5X cell wet weight. The resulting fraction, fraction I (1,100 ml), contained 15.6 g of 
protein and 46.8 x 10* units of activity. 

Ammonium sulfate was added to 0.2 M (29.07 g) and tiie lysate stirred for 30 

20 minutes on ice. Upon the addition of the ammonium sulfate, a precipitate formed 
which was not iemoved prior to the PEI precipitation step, described below. 
Amraoniuin sulfate prevents the Tth polymerase from binding to DNA in the crude 
lysate and reduces ionic interactions of the DNA polymerase with other cell lysate 
proteins. Speed in tiie initial steps of purification (i.e., up to loading onto and eluting 

25 from the phenyl-sepharose column) and the presence of protease inhibitor (PMSF at 
2.4 mM) are important for protection from proteolytic degradation of the DNA 
polymerase. For best results, tiien, one proceeds directly to tiie Polymin P (purchased 
from BDH) precipitation step to remove most nucleic acids rather tiian introducing a 
centrifiigation step to remove the precipitate that forms upon the addition of ammonium 

30 sulfate. For tiie same reason, one can include in fraction n the soft, viscous pellet that 
forms on top of tiie Polymin P/ammonium sulfate pellet, because tiie viscous pellet 
does not contain nucleic acids. Agarose gel electrophoresis and etiiidium bromide 
staining of tiie Polymin P supernatant indicates tiiat >9p% of tiie macromolecular DNA 
and RNA is removed by 0.2% Polymin P. To account for tiie additional amount of 

35 protein, when tiie viscous pellet is included, tiie phenyl sepharose column should tiien 
be ~10% larger than described below. 
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Empirical testing showed that 0.2% Polymin P (polyethyleneimine, PEI) 
precipitates ^90% of the total nucleic acid. Polyniin P (pH 7.5) was added slowly to 
0.2% (22 ml of 10% PED and the sluriy stined one hour on ice, then centrifuged at 
30.000xg at 4'C for 45 minutes. A soft, viscous peUet foimed on top of the PEI pellet, 

5 requiring additional centrifogation after 920 nil of the supernatant was decanted. The 
viscous material was centrifuged for one hour at 186.000xg at 2'C and yielded an 

additional 40 ml of supernatant and veiy large gelatinous pellets. These pellets 
contained <2% of the activity present in fraction I and 1.96 g of protein or 12.5% of 
fraction I. TTie supematants were pooled (fraction n, 960 ml) and contained 10.5 g 

10 protein and 42.6 x 10* units of activity. 

Fraction H was loaded onto a 3.2 x 6.5 cm (52 ml) phenyl sepharose CL-4B 
(I^t MI 02547, purchased from Phairaacia-LKB) column (equilibrated in TE 
containing 0.2 M ammonium sulfate and 0.5 mM DTT) at 80 ml/hr (10 m]/cm2/hr). All 
resins were equilibrated and recycled according to the manufacturer's 

15 recommendations. The column was washed with 240 ml of the same buffer (A2S0 to 
baseline), tiien with 220 ml TE containing 0.5 mM DTT (no ammonium sulfate) to 
remove non-Tth DNA polymerase proteins. The column was then washed with 270 ml 
of 20% ethylene glycol in IE containing 0.5 mM DTT to remove more contaminating 
protein, and the Tth polymerase activity was eluted with 2 M urea in TE containing 

20 20% ethylene glycol and 0.5 mM DTT. The fractions (5 ml) containing the polymerase 
activity were pooled (fraction Hla, 84 ml). The routine activity assays of the flow- 
through and wash fractions revealed that only -50% of the applied poiymcrdSc activity 
had bound when die capacity of die column was exceeded. To avoid exceeding die 
capacity of the column, a larger column (with, for example, at least 2X as much phenyl 

25 sepharose) should be used. The flow-dirough and wash fractions containing the 

balance of the activity were pooled (fraction lib, 685 ml), adjusted to 0.2 M ammonium 
sulfate, and dien reapplied to the same column after the column had been recycled and 
reequilibrated. 

Assays of low levels of Tth DNA polymerase activity in fractions containing 
30 Polymiri P (e.g., phenyl sepharose flow-tfirough fractions) should be conducted in the 
Fesence and absence of 10 mM EDTA. The presence of EDTA permits coirecrion for 
elevated background levels of radioactivity due to Polymin P binding of the nucieotide 
triphosphate substrate. 

As noted above, die Tth polymerase activity was eluted with a 2 M urea step 
35 (fraction ma). The eluant was dialyzed into heparin sepharose loading buffer to avoid 
prolonged exposure to urea (to avoid carijamylation) while waiting for the unretained 
fraction nb to be rerun over die same phenyl sepharose column. The dialyzed fraction 
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nia contained 42% of the applied activity (179;il3 units) and about 3,5% of the 
applied protein (351 mg), yielding a 12-fold purification. The pooled flow-through and 
0.2 M aimoonium sulfate wash fractions containing the unbound Tth DNA polymerase 
(fraction nb) consisted of 42.6% of the applied activity (181,559 units) and 40.8% of 

5 the applied protein (4,1 10 mg). The column was recycled as recommended by the 
manufacturer, reequilibrated with the starting buffer, and fraction lib was reapplied. 

Fraction lib was loaded onto the phenyl scpharose column at 78 ml/hr. The 
column was washed wiUi 270 ml of 0.2 M ammonium sulfate in TE containing 0.5 mM 
DTT, then with 170 ml TE contaiiung 0.5 mM DTT (no ammonium sulfate), and finally 
10 witii 260 ml of 20% ethylene glycol in TE containing 0.5 mM DTT, The Tth 

polymerase activity was again eluted with 2 M urea in TE containing 20% ethylene 
glycol and 0.5 mM DTT. The fractions (4.3 ml) containing tiic polymerase activity 
were pooled (fraction nib). ITie 2 M urea eluatc (fraction Illb) contained 87,6% of the 
applied activity (159,132 units) and 8.8% of the applied protein (363 mg), yielding a 
15 9.7 fold purification. 

Fraction lUb (1 1 6.4 ml) was adjusted to 0. 1 5 M KCl and pooled with fraction 
nia, which had been dialyzed without loss of activity into a buffer composed of 50 mM 
Tris-Cl, pH 7,5, 0,1 mM EDTA, 0.2% Twecn 20, 0.5 mM DTT, and 0,15 M KCl and 
stored at 4'C. The pooled fraction HI (243 ml) contained substantial levels of 

20 contaminating specific and non-specific Tth cndonucleases and exonucleases. The 
combined fraction EI contained 326,009 units of activity and 705 mg protein. 

Fraction III was loaded onto a 2.2 x 12 cm (45 ml) heparin sepharose CL-6B 
(purchased from Pharmacia-LKB) column and equilibrated in 0,15 M KCl, 50 mM Tris- 
Cl, pH 7.5, 0.1 mM EDTA, 0,2% Tween 20, and 0.5 mM DTT) at 45 ml/hr. All of 

25 the applied activity was retained by the column. The column was washed with 175 ml 
of tiie same buffer (A28O to baseline) and eluted with 670 ml of a linear 150-750 mM 
KCl gradient in the same buffer. Fractions (5.25 ml) eluting between 0.31 and 0,355 
M KCl were pooled (fraction IV, 149 ml). Similar to Taq DNA polymerase, which 
elutes with a peak at 0,3 1 M KCl, Ttii DNA polymerase elutcs with a peak at 0.33 M 

30 KQ contaminated with the coeluting TthHBSI endonuclease (an isoschizomer of 3^1 
endonuclease [TCGA]). 

Fraction IV was concentrated - 10-fold on an Amicon YM30 membrane and 
subsequentiy dialyzed against 25 mM Tris-Cl, pH 7.5, 0.1 mM EDTA, 0.2% Tween 
20, 0.5 mivl DTT, and 100 mM KCL A precipitate formed during dialysis and was 

35 removed by centiifugation (10 minutes at 12,000xg, 4*C) witfiout loss of activity. 
These steps, including the heparin sepharose column, yielded a 27-fold purification, 
with 95% of the activity applied to tiie heparin-sepharose column being recovered. 
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Although Tth DNA polymerase shares 88% sequence identity (93% similarity) 
with Taq DNA polymerase, the -10% difference in the two proteins changes their 
purification properties on phosphocellulosc significantly. In contrast to Taq DNA 
polymetase, which, when run in pH 7.5 Tris buffer, elutes at 0.2 M KCl from 
phosphocellulosc with its contaminating endonuclease eluting at -0.6-0.8 M KCl, Tih 
DNA polymerase and endonuclease cannot be easily separated on phosphocellulose. 
Tth DNA polymerase elutes with a peak at -0.45 M KQ and the Tth endonuclease peak 
is at 0.58 M KQ. Affigel-blue (Biorad Laboratories), however, is a useful resin for 
separating Tth endonuclease from Tth DNA polymerase. Affigel blue is a dye-Ugand 
resin used for affinity purification of enzymes with binding sites for nucleotides. 

The supernatant torn centrifijgation effraction IV (16.8 ml) was loaded onto a 
1.6 X 10 cm (20 ml) affigel-blue column (equilibrated in 25 mM Tris-Cl, pH 7.5, 0.1 
mM EDTA. 0.2% Tween 20. 0.5 mM DTT, and 100 mM KCl) at 20 ml/hr. AU of the 
applied Tth DNA polymerase activity bound to tiie resin. The column was washed with 
30 ml of the same buffer (A280 to baseline) and eluted with a 300 ml linear 0.1-0.5 M 
KCl gradient in the same buffer. Fractions (3.05 ml) eluting between 0.28 and 0.455 
M KCl were assayed to ensure absence of contaminating double- and single-strand 
endonuclease, indicated by absence of both lower molecular weight specific or non- 
specific DNA fragments after one hour or eleven hours incubation at 60'C with 5-20 
units of Tth polymerase activity using 600 ng of plasmid pLSGl covalenily-closed 
circular DNA or 850 ng of M13mpl8 SS-DNA. When the KCl gradient was applied, 
the Tth polymerase eluted with a fairly broad peak at -0.35 M KCl, while the 
endonuclease seemed to elute at >0.5 M KCl. Washing die affigel-blue column with 
0.15 M KCl and eluting with a linear 0. 15-0.6 M KCl gradient may provide better 
separation. 

Based on die SDS-PAGE pattern, two pools were made: fraction Va from peak 
fractions (61 ml) and fraction Vb, from flanking fiactions (72.5 ml). Fraction Va 
contained 22.2 x 104 units of activity and 5.5 mg of protein, and fraction Vb contained 
5.2 X 10* units of activity and 3.5 mg of protein. Both pools were concentrated 
separately by diafiltration on YM30 membranes. Fraction Vb was concentrated -10- 
fold on an Amicon YM30 membrane, then dialyzed into CM-Trisacryl buffer (25 mM 
sodium acetate buffer, pH 5.0, 0.5 mM DTT, 0.1 mM EDTA, and 0.2% Tween 20) 
containing 50 mM NaCl. Again, a precipitate formed during dialysis and was removed 
by centrifugation (12,000xg for 10 minutes at 4'C) resulting in a minor (<2%) loss of 
activity and a 1.4-fold purification. The resulting supernatant (8.6 ml, 5.1 x 10* units 
of activity and 2.3 mg of protein) was loaded onto a 1 x 3.8 cm (3 ml) CM-Trisaciyl 
column (equiUbrated in CM-Trisaciyl buffer and 50 mM NaQ) at 3 ml/hr. All of the 
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applied activity was retained by the column. The column was washed with 17 ml of the 
same buffer and eluted with SO inl of a steep, linear 0.05-0.7 M NaCl gradient in the 
same buffer. Fractions (1 ml) elutihg between 0.175 and 0.25 M were analyzed by 
SDS-PAGE electrophoresis prior to being pooled with fraction Va. The Ttfi DNA 

5 polymerase activity eluted with a sharp peak at 0.21 M NaQ. Judged by SDS-page of 
the gradient fractions, the polymerase was significandy enriched but still contained 
major contaminating bands at -35 kDa, -25 kDa, and -1 8 kDa. The resulting fraction 
V ( 11 .4 ml), which contained fraction Va and tiie peak fractions from the CM-Trisacyl 
column treatment of Fraction Vb, was dialyzed into CM-Trisacryl buffer containing 50 
10 mM NaCl. More precipitate fonned and was removed by centrifugation (10 minutes at 
12,000xg, 4*C) witii insignificant loss of activity. The precipitate contained 0.91 mg 
protein (-20%) and 2,227 units of activity (<1 %). 

The resulting supernatant (12.8 ml, containing 5.18 mg protein and 24.8 x 10^ 
units of activity) was loaded onto a 1.6 x 6.0 cm (12 ml) CM-Trisacryl (purchased 

15 fiiom Pharmacia-LKB) column (equilibrated in CM-Trisacryl buffer containing 50 mM 
NaCl) at 12 ml/hr. The column was washed with 20 ml of the same buffer containing 
50 mM NaCl, tiien with 27 ml of the same buffer containing 100 mM NaCl. No 
detectable polymerase activity appeared in the flow-through fractions. A technical 
problem (column adaptor broke) led to the immediate elution (in 400 mM NaQ) of the 

20 activity when the 100-400 mM NaQ linear gradient was applied. Seventy-eight percent 
of the applied activity (19.4 x 10* units and 4.09 mg protein) was recovered and 
reapplied to a CM-Trisacryl column of the same dimensions. 

The loading fraction (35 ml) was 2.7-fold diluted after readjusting tiie solution 
to 50 mM NaCl. The column was washed with 33 nil of die same buffer and eluted 

25 with a 180 ml linear 50-400 mM NaCl gradient in the same buffer. Fractions (1.4 ml) 
eluting between 0. 16 and 0.2 M NaCl were separately concentrated/diafiltered on 
Centricon 30 membranes in 2.5X storage buffer (50 mM Tris-Cl, pH 7.5, 250 mM 
KQ, 0.25 mM EDTA, 2.5 mM DTP, and 0.5% Tween 20 [Pierce, Surfact-Amps]). 
The Ttii DNA polymerase activity eluted with a peak at 0.183 M NaCl. slighdy earlier 

30 than was observed in the trial column. In comparison, Taq DNA polymerase elutes at 
0. 19-0.205 M NaCl when run on CM-Trisaciyl in the same pH 5.0 sodium acetate 
buffer. The concentrated and diafiltered samples were diluted with 1.5 volumes of 
80% glycerol (Fisher, spectral grade, autoclaved) and stored at -20'C until completion 
of tiie analysis of the individual fractions by SDS-PAGE. The fractions containing the 

35 Tth polymerase were of similar purity (-85-90%), as determined by SDS-PAGE gel 
electrophoresis. The major band migrates as a -90 kDA protein in tiiis gel system with 
minor contaminating bands. The discrepancy between this observed molecular weight 
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(-90 kDa) and the calculated molecular weight (-94 kDa, from the gene sequence) may 
simply be due to anomalous gel migration or to degradation during die purification 
process. The staining patterns of the individual fractions were similar enough to allow 
pooling of all of the fractions (fraction VI, 21 .5 ml). 
5 Fraction VI was funher concentrated/diafiltered on an Amicon YM30 membrane 

in 2.5X storage buffer. When die volume measured 7 ml, 0.2 ml were removed for 
amino acid composition and sequence analysis. The remaining 6.8 ml .were 
concentrated to 1.6 ml and diluted witii 2.4 ml of 80% glycerol. The resulting final 
pool (4 ml) contained 2.17 mg protein and 162,789 units of activity (34.8% yield) with 
10 a specific activity of 75,018 unitsAng protein. The results of each step of the 
purification are presented in tabular form below. 
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Example 2 

Qoning the Thermus thenmophilus Tth DN A Polvmera^P T fiene. 

This Examples desaibes the strategy and methodology for cloning the Tth DNA 
polymerase I (Tth Pol I) gene of Ihanms thermophilus. PCR-amplificd fiagments of 
5 the 1. aquatigUS DNA polymerase I (Taq Pol I) gene were used to probe genomic DNA 
blots to determine the restriction sites present in the Tth Pol I gene and flanking 
regions. PGR amplification of the Tth Pol I gene with Taq Pol I-specific primers 
provided even more restriction site and DNA sequence inforaiation about the Tth Pol I 
gene. This infonnation provided the basis for a two-step cloning procedure to isolate 
10 the Tth Pol I gene into plasmid pBS 13+ (mariceted by Stratagene; the plasmid is also 
known as BSM13+). 

At Preparation of Pmhft^ 

Four labeled probes were generated by PGR in the presence of biounylated 
dinP (biotin-1 1-dUTP, purchased from BeUiesda Research Laboratories) and 

15 Thgrmws aqtiatigHS DNA to probe southern blots of X thermophilus genomic DNA. 
Probe A was generated with primers CM07 and EK194 and encompasses 438 bp of the 
5' end of the Taq Pol I gene from nucleotide -230 to +207. Probe B was generated 
with primers MK138 and MK124 and encompasses 355 bp that span the Hindm site of 
tiie Taq Pol I gene and extsnd from nucleotide +555 to +879. Probe C was generated 

20 witii primers MK143 and MK 1 3 1 and encompasses 579 bp of tiie template-primer 
binding site coding sequence and the fiamHI site of tiie Taq Pol I gene from nucleotide 
+1313 to +1891. Probe D was generated witii primers MK130 and MK151 and 
encompasses 473 bp of tiie 3' end of tiie Taq Pol I gene from nucleotide +2108 to 
+3384. 

25 The sequences of tiie primers used to prepare tiie probes arc shown below: 

CM07 5'-GCGTGGCGGCGGAGGCGTTG 

EK194 5'.CrTGGCGTCAAAGACCACGATC 

MK124 5'-GGCCTTGGGGCTTrCCAGA 

MK130 5'-TGCGGGCCTGGATTGAGAAG 
30 MK131 5'-CCCGGATCAGGTTCTCGTC 

MK138 5'-GACCGGGGACGAGTCCGAC 

MK143 5'-CCGCTGTCCTGGCCCACATG 

MK151 5'-TTCGGCCCACCATGCCrGGT 
The sequence of tiie Taq Pol I gene is disclosed in Lawyer £t §1. and in U.S. Patent 
35 application Serial No. 143.441. filed January 12, 1988, botii incoiporated hercin by 
reference. 
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The probes were individually prepared in 100 jil of total reaction mixture 
composed of ,10 mM Tiis-HCl, pH 9.0 (the pH was set at nine to counteract the pH of 
the biotinylatcd dUTP in the reaction mixture; the biotinylated dUTP is in a buffer of 
100 mM Tris, pH 7,4), 50 mM KCl, 1-0 mM MgCh. 100 jig/ml gelatin, 2 U of Taq 

5 Pol I (marketed by Perkin-Elmer Cetus Instruments), 50 ^M dATP, 50 ^iM dCTP, 50 
' ^M dGTP, 37.5 ^lM TIP, 12.5 jiM biodn-1 1-dUTP, 50 pmol each primer and 
template DNA. The template DNA consisted of 1 pi of a 1 : 100 dilution of PGR 
products generated with the same primers in 25 cycles of a polymerase chain reaction in 
a reaction mixture composed of 10 mM Tris-HQ, pH 8.3; L5 mM MgCh; 200 \M 
10 each dNTP; no biotinylated dUTP; and 1.0 ng Taq genomic DNA boiled for three 

minutes and then quickly cooled on ice. PGR was performed in a Perkin-Elmer Getus 
Instruments Thermal Gycler. Probes and the template for probe generation were 
generated using 15 cycles of a 1 minute 45 second ramp to 98*G, 15 seconds at 98*G 
(in-tube temperature of 96.5'C), 45 second ramp to 55*G, 20 seconds at 55*G, 45 

15 second ramp to 72*G, and 30 seconds at 72*C There was a 5 minute soak at 72*C at 
the end of the last cycle. 

The genomic DNA hybridized to the probes was isolated as described in 
Lawyer £15^., and Southern blots were performed as described by Maniatis, except that 
MSI Magnagraph™ nylon membrane was used rather than nitrocellulose, and the DNA 

20 was fixed to the membrane widi U V light (in a UV Stratalinker™ 1 800, marketed by 
Stratagene) rather than heat. 

Blots were prehybridized at 42'C for 2 hours in a solution composed of 5X 
SSPE, 5X Denhardt's solution, 0,5% SDS, 5% dextran sulfate, 150 |ig/ml carrier 
DNA, and 50% foraiamide. Hybridization of probes to the blots was carried out 

25 overnight at 42'C in the same solution with probe present at approximately 10 ng/ml. 
After hybridization, the membranes were washed to remove unbound probe. 

Each of the four probes A-D hybridized to Themius thermophilus genomic 
DNA, A restriction site map of the Tth Pol I gene region of the genome was 
constructed by individually digesting and probing Southern blots of the digested Tth 

30 genomic DNA with restriction enzymes EslI, fiamHI, SacH, and A5E7 1 8. In addition , 
double digestions with Hindni/Asi2718, HindlllZEfilEn; HiadlH/fclhel; 
£anHVAffi718; MllHI/EsiEII; £amHI/5EllI; and 2amHI/13h£l of Tth genomic DNA 
followed by Southern blotting and probing of the digested DNA were performed. The 
resulting information allowed the construction of a restriction site map used in the 

35 cloningof the Ttfi Poll gene. 
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fi* PCR Amplification of the Primer - Templat e Binding Site Region of thR Tth Pni 

A series of PCR amplifications was canied out using Tth genomic DNA as 

5 template and primers homologous to IhsmMaauaticilS DNA in tiie region of the Taq 
Pol I gene that encodes the primer-template binding site sequence of Taq Pol I. Several 
primer pairs in various combinations were used in the amplifications, which were 
targeted to amplify various regions of the Tth DNA Pol I gene corresponding to the 
region fixHn nucleotide 293 to 1891 of tiie Taq Pol I gene. One primer pair, MK143 

10 and MK131, yielded product. 

. The an^lification reactions were carried out in a buffer composed of 10 mM 
Tris-HCl, pH 8.3, 50 mM KCl, 1 .5 mM MgCh. 200 \lM each dNTP, 2 U Taq Pol 1. 1 
ng heat-denatured Ttii genomic DNA, and 50 pmol of each primer. The amplifications 
were carried out for 25 cycles using the same thermocycler programming described 

15 above, and PCR products were analyzed via polyacrylamide gel electrophoresis. 

Most of the primers used in the unsuccessful amplifications either had many 
mismatches when later compared with the Tth Pol I gene sequence or had strategic 
mismatches at die 3' end of Uie primers. Primer MK143 had 3 mismatches to the Tth 
Pol I gene sequence but those mismatches were located at the 5' end of the primer and 

20 were followed by 15 bases of homology. Primer MK131 had 2 mismatches to the Tth 
Pol I gene, but tiie mismatches were located in the middle of the primer. 

The product oif tiie MK143/MK13 1 amplification of Ttii genomic DNA migrated 
on a polyacrylamide gel identically with tiie MK143/MK13 1 amplification product 
. using Taq genomic DNA as template. Restriction mapping of tiiese Taq and Tth 

25 amplification products show identical fiamHI, Sad , and Xhol restriction sites but 
different SacH and Esfl restriction sites. The Ttfi PCR product generated with primers 
MK143 and MK131 was funher amplified via asymmetric PCR witii tiie same primers 
and subjeaed to DNA sequence analysis in accordance witii the metiiods described in 
Gyllensten and Erlich, 1988, Eeqc. IM- Acad- Sfii- USA M(20):7652-7656; and Innis 

30 £ial., 1988, £db£. Nail. AM. Sfii. USA 55:9436-9440. 

Cloning the 5' End of the Tth Pol T Genp. 

From the restriction site map and sequence information generated by Uie 
SoutiiCTn blot and PCR analyses, a strategy for cloning tiie Tth Pol I gene in two steps 
was developed. An -3 kb Hindffl fragment of Ttii genomic DNA hybridized with 
35 probes A, B, and C but not D, indicating tiiat die fragment contains tiie 5' end of the 
Ttfi Pol I gene. TTus ~3 kb Hindin fitigment also contained a BamH I restriction site, 
which proved useful in cloning tiie 5' end of the gene. 
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To clone the 5' end of the Tth Pol I gene, a Hiadin digest of Tth genoniic DNA 
was size fractionated by electioelution on a 0.5 inch tube gel by collecting 250 ^1 
fractions every 5 minutes during electrophoresis as fragments of about 3 kb in size . 
were elutingfiran the gel. Dot blots with the probes described above identified the 

5 fractions containing the restriction fragments of interest The fractionated DNA of 
interest was then digested with restriction enzyme £amHI and treated with calf-intestine 
alkaline phosphatase (CIAP). CIAP was purchased from Boehringer Mannheim and 
used as directed by the manufacturer. Restriction enzymes, £. coli DNA polymerase, 
and ligase enzymes used in these Examples can be purchased from manufacturers such 

10 as New England Biolabs, Boehringer Mannheim (ASC? 1 8), and Promega (C5ll451, an 
isoschizomer of AsuID and used as directed by the manufacturer. 

Plasmid pBS 13+ (purchased from Stratagene) was likewise digested with 
restriction enzymes Hindm and BamH I and then ligated with the BflEOHI digested, 
CIAP-trcated -3 kb Hindm fragment pool. The ligation mixture was used to transform 

15 E. £2li K12 strain DG 98 QM-l. filldAl, hsdRH, iacIQ, la£ZAM15, i2mC::TnlO, 
supE 44/F. lasIQ, la£ZAM15, proC+. available from the ATCC under accession 
number 39,768) in substantial accord with the procedure of Hanahan £i ai- The 
ampicillin resistant (AmpR) transformants were screened by failure to exhibit blue color 
on X-gal plates and by probe hybridization with the DNA of transformed cells (via 

20 replica plating and lysis of the replicated cells as described by Woods si sl-, 1982, 

Proc . Natl . Acad , gci. USA 22:5661) with 32p-iabeled (by kinase treatment with -^2?. 
ATP) primer MK143. One colony contained a plasmid, designated pBSM:Tth5'; in 
which the -2.5 kb Hindm -BamH I restriction fragment had ligated with the large 
Hindin -BamH I restriction fragment of plasmid pBS13+. 

■ ■ ! 

■ . ' I 

25 H Cloning the 3' End of the Tth Pol I Gene 

The 3' end of tiie Tth Pol I gene wxs inserted into plasmid pBSM;Tth5' to yield 
a vector, designated pBSM:Ttii,-tiiat contains the intact coding sequence of die Tth Pol I 
gene. The Soutiiem blot and DNA sequence information showed tiiat an ~1 2 kb 
BamH I fragment of Ttii genomic DNA could be digested with AazVl S to yield an -5.6 

30 kb fragment tiiat hybridized with Probe D (die fragment should also hybridize with 
Probe C). The information also showed tiiat die BamHI site used to create die -5.6 kb 
BamH I-As p7 18 restriction fragment was tiie same BaoiHI site used to create die -2. 5 
kb Hindin-BamHI restriction fragment in plasmid pBSM:Ttii5'. 

TUi genomic DNA was tiien digestwi to completion witfi restriction enzyme 

35 BamH I and size-fractionated as described above, except that fractions containing 
fragments of -12 kb in size were identified and collected. Fractions which hybridized 
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in a dot blot to biotinylated Probes D and C were pooled, digested with restriction 
enzyme Asp718, treated with CIAP, and ligated with BamH I-As p7 18 digested plasmid 
pBSM:Tth5'. The ligated DNA was transformed into £. cdi K12 strain DGlOl (M-h 
endAl. hsdRH, lacR lacZAMlS. BmC::TnlO). 
5 The AmpR transformants were screened as above with 32p.iabeled primer 

MK132 to identify several colonies that contained a plasmid, designated pBSMrTth, 
that contained the --5,6 kb fiamHI-ASipV 1 8 and -15 kb Hindm -BamH I fragments in 
the correct orientation to reconstruct an intact coding sequence of the Tth Pol I gene. 
The sequence of oliognucleoride MK132 perfectly matches the Tth Pol I gene sequence. 

10 Several colonies with plasmid DNA that hybridized to the probe and yielded the 
expected ftagments on restriction enzyme digestion were induced with IPTG, and 
Western blot analysis of protein samples from induced and uninduced colonics with 
Taq Pol I polyclonal antibody showed an IPTG inducible band the same size (-94 kDa) 
as Taq Pol L One such colony was deposited with the ATCC and can be obtained from - 

15 the ATCC under accession number ATCC 68195. When culturing the strain, one must 
maintain selective pressure (ampicillin) to prevent loss of plasmid DNA. ATCC 68 195 
can thus also be used to obtain untransformed DG 1 0 1 cells. 

Constrvigtion of Plasmid pLSQ2i 

20 The deletion of 3' noncoding ("downstream") sequences has been shown to 

enhance recombinant expression of Thcrmus DNA polymerase in E. coli . In 
pBSMrTih, double digestion with restriction enzymes BstEII and Kpn l followed by 
Klenow repair in the presence of all four dNTPs and ligation under dilute conditions to 
favor intramolecular ligation results in the deletion of 3' noncoding sequences of the 

25 Tth DNA Pol I gene. Restriction enzyme BsiEII cuts plasmid pBSM:Ttii in the 3' 
noncoding region of the Tth Pol I gene, and restriction enzyme Kpnl cuts in tiie 
polylinker region of the vector. 

This deletion was made, and the resulting plasmid was designated as plasmid 
pLSG21. The deletion protocol results in die regeneration of the EstiEII restriction site. 

30 However, plasmid pLSG21 does not drive increased levels of Tth Pol I expression 
when compared to the levels achieved in plasmid pBSMrTth-transformed £. coli host 
cells. 
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Exanyle4 

Construcrion of Plasmids pT^SG22. pLSG2^. and pLSfi24 
The Tth Pol I gene lacks convenient restriction sites at the 5' and 3' ends of the 
gene. Such restriction sites facilitate the construction of a wide variety of expression 
5 vectors. In addition, codons at the 5' end of the coding sequence are highly GC-rich, 
which may inhibit efficient translation initiation and expression in £. £2li- Site-directed 
mutagenesis with oligonucleotides can be, and has been, used to introduce a number of 
useful changes in tiie coding sequences and in the 5' and 3' noncoding regions of Uie 
Tth Pol I gene. 

10 Plasmid pBS 1 3+ derivatives, such as plasmid pBSMrTth, can be obtained in 

single-stranded form by the protocols described in Lawyer £i al. and by Stratagene, the 
commercial supplier of plasmid pB S 1 3+. To make single-stranded plasmid pBS 1 3+ or 
a single-stranded daivarive plasmid, a host cell transformed with the plasmid is 
infected with a helper phage (such as R408) and cultured under conditions that allow 

15 production of phage DNA. The phage DNA is then collected and comprises the desired 
single-stranded DNA and a small amount of helper phage DNA. The desired DNA can 
be purified to remove the helper phage DNA by separating the DNA based on size, i.e., 
by electroelutipn. 

For the constructions described below, a plasmid, designated pBSMAPvuII, 

20 proved useful. Plasmid pBSMAPvuH was generated by deletion of the 382 bp Pvul l 
fragment of plasmid pBS 13+. The site-specific mutagenesis protocols involved the 
following steps: (1) single-stranded plasmid pBSM:Tth (or other pBS 13+ single- 
stranded derivative) and double-sn^ded, PvuII digested plasmid pBSMAPvuII were 
annealed by boiling a 1 to 2.5 molar ratio of pBSM:Ttfi (or other plasmid pBS 1 3+ 

25 derivarive)/ipBSMAPvun for three minutes in Klenow salts and then incubating the 
resulting mixture at 65*C for 5 minutes; (2) kinased mutagenizing oligonucleotide was 
then annealed to the resulting gapped duplex at a molar ratio of 5 to 1 by heating the 
oligonucleotide to 95'C for 1 minute and tiien adding the oligonucleotide to the gapped 
duplex mixture held at 75'C; (3) tiie resulting mixttire was incubated at 75'C for 2 

30 minutes and then slowly cooled to room ten:5)eranjre; (4) this annealed mixture was 
tiien extended with Klenow enzyms in tiie presence of all four dNTPs (2j00 ^iM in each 
dNTP) for 15 minutes at 37*C witii tiie addition of Ugase and 40 jiM ATP to tiie 
reaction. The resulting mixture was used to transfofm £. £Qli K12 DGlOl . 

The AmpR transformants were screened by probing witii the appropriate 

35 screening primer. Colonies tiiat had plasmid DNA diat hybridized to tiie probe were 
expanded into 3 ml cultures in R66 media (0.6% beef extract, 0.6% yeast extract, 2% 
peptone, 0.5% NaCl, 40 raM KPO4, pH 7.2, 0.2% glucose, and 100 jig/ml 
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ampicillin), incubated at 37'C for eight hours, and then used to prepare plasmid DNA 
by die mediod of Bimboim and Doly. The resulting plasmid DNA was subjected to 
restriction en^oie and DNA sequence analysis to ensure that the desired plasmid was 
obtained. 

5 A* Constniction of Plasmid pLSG22 

EcqRV and figin restriction enzyme sites were introduced downstream of the 
- TGAstopcodonof die Tth Pol I gene coding sequence by the foregoing method using 
oligonucleotide DG122 to mutagenize plasmid pBSMrTih and oligonucleotide DG 123 
to identify the desired transforaiants by probe hybridization. These oligonucleotides 
10 are shown below: 

Egin ^RV 

DG122 5' CCTCTAAACGGCAJaATClGAIAICAACCCTTGGCGGAAAGC 3' 

DG123 5' CAGATCTGATATCAACCC 

The resulting plasmid was designated pLSG22. 

15 Construction of Plasmid lilSCm 

Plasmid pLSG22 was mutagenized to introduce BsiXI and Asel (CiiE45I) 
restriction sites at the ATG start codon of the coding sequence of the Tth Pol I gene. In 
addition, codons 2, 3, and 5-7 were altered to be more AT-rich without changing the 
amino acid sequence of the resulting protein. The mutagenizing oligonucleotide was 

20 DG189, depicted below: 

-— BfitXI 

AauH AaeT 

EiG189 5'CCGGCCTTTGGGTlCeMTAATGGTAACATAGCTCCCailMlTTGGGCCACCTGTCCCCG 
3' 

25 Tth TTCAAAGAGCGGAAGCATCGCCTCCAT 
Codon 9 8 7654321 

The resulting plasmid was designated pLSG23. Transformants harboring plasmid 

pLSG23 were identified by their AmpR phenotype and by hybridization with 

oligonucleotide DGl 18, which has the strucmre shown below: 

30 DG118 5" TGGTAACATAGCTTCCAT 3" 

£L Constriction of Plasmid plJiGld 

Plasmid pLSG22 was mutagenized to introduce BstXI and Ndel restriction sites 
at the ATG Stan codon of the coding sequence of die Ttii Pol I gene. In addition, 
codons 2, 3, and 5-7 were altered to be more AT-rich wiUiout changing the amino acid 
35 sequence of the encoding protein. The mutagenizing oligonucleotide was DG190, 
depicted below. 
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-aatxi 

DQ190 '^*rrrvirr'r''v'r(vu ^TrrGPJ^.' f'^Jir(SG'rPJ^^^ 
3' 

5 Tth TTCAAAGAGCGGAAGCATCGCCTCCAT 
Codon 9 8765 4 3 21 

The resulting plasmid was designated pLSG24. Transformants harboring plasmid 
pLSG24 were identified by their AmpK phenotype and by hybridization with 
oligonucleotide DG 118. 

j^o Examplg 5 

rnnsmictior. o f Plasmids pLSG27 and pLS28 

• A. rnnstnirrinn of Plasmid pBSM:TthAStuI/Hindni 

Plasmids pLSG27 and pLSG28 are Tth Pol I expression vectors that drive 
expression of a truncated form of Tth Pol I. The truncation is an -80 codon deletion 

15 &om the amino-terminal-encoding region of the coding sequence for Tth Pol I. To 
construct these vectors, plasmid pBSM:Tth5' was first digested to completion with 
restriction enzymes Slul and HindHI. The digested plasmid DNA was then treated with 
Klenow enzyme in the presence of all four dNTPs and reciicuiarized by ligation. This 
treatment deleted the 5' noncoding region Uuough codcm 78 (the SluI site spans codons 

20 77-79) of the Tth Pol I gene. Plasmid pBSM:Tth5' also lacks the 3" end of the Tth Pol 
I coding sequence. The resulting plasmid was designated pBSM:TtiiAStuI/HindIII. 

Construction of Plasmid pLSG25 

Plasmid pBSM:TthASnjI/Hindin was mutagenized with oligonucleotide 
DGI91 as described above to yield plasmid pLSG25. In plasmid pLSG25, the 
25 truncated Tth Pol I coding sequence is placed in position for expression from the la£ 
promoter. In addition, the lasZa coding sequence is deleted, and an AS£l restriction 
enzyme recognition site is placed at die ATG start of die truncated coding sequence. 
The DG191 mutagenizing linker has the following strucmre: 

DG191 5*- 

30 CCTCCCCGCCTrcTAGGCCATrAArrrGGTCTCCTGTGTGAAATTGTTATC-S* 

Transformants harboring plasmid pLSG25 were identified by their AmpR phenotype 
and by hybridization with oligonucleotide DG193, which has Uie following strucmre: 
DG193 5'-TTTGGTCTCCTGTGTG-3' 
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£^ Constnicrion of Plasmid pTi;n2fi 

Plasmid pLSG26 was constructed in the same manner as plasmid pLSG25, 
except that the mutagenizing linker was DG192 as opposed to DG191. DG192hasthe 
following structure: 

5 DGi92 5'- 

CCTCC(XGCCTroTAGGCCATAT(OTTXK3TCTCCTG^ 

Plasmid pLSG26 is identical to plasmid pLSG25, except that an Nde l. as opposed to 
AfiSi, restriction enzyme recognition site spans the ATG start codon of the truncated 
coding sequence. Transfonnants harboring plasmid pLSG26 were identified by their 
10 AmpR phenotype and by hybridization with oligonucleotide DG193. 

JL Final Construction of Plasmids pLSG27 and pIi;n2S 

As noted above, plasmid pBSM:Tth5' lacks the 3* end of the Tth Pol I coding 
sequence, so plasmids pLSG25 and pLSG26 also lack this sequence. To place this 3' 
end of the Tth Pol I coding sequence in plasmids pLSG25 and pLSG26 in the correct 

15 reading frame, each plasmid was digested to completion with restriction enzymes 
fiainHI and E^RI. The large EceRI-JBamHI fragment of plasmid pLSG25 was then 
ligated with the ~1 .2 kb BamHI-EcoRI restriction fragment of plasmid pLSG22 to yield 
plasmid pLSG27. The ~1 2 kb fiamHI-EccRI restriction fragment of plasmid pLSG22 
contains the 3' end of the Tth Pol I coding sequence. In a similar fashion, plasmid , 

20 pLSG26 was digested with restriction enzymes EamHI and EcqRI and ligated witli the 
-1.2 kbfiailHI-EcfiRI restriction fragment of plasmid pLSG22 to yield plasmid 
pLSG28. Both plasmids pLSG27 and pLSG28 drive low level expression in £. cqH of 
a truncated form of Tth Pol I with polymerase activity. 

Example 6 

25 Construction of Plasmids pLSG29 Through pLSG34 

Although the las promoter in plasmids pBSM:Tih, pLSG21 , pLSG22, 
pLSG23, pLSG24. pLSG27, and pLSG28 drives expression of Tth Pol I activity in E. 
£fili, one of skill in the art recognizes that utilization of a stronger promoter than the jag 
promoter might increase Tth Pol I expression levels. One well known, powerful 

30 prompter is the Pl promoter from phage X. In addition, higher expression levels and 
. more efficient production can be achieved by altering the ribosome-binding site, 
transcription termination sequences, and origin of rqjlication (ot associated elements) 
of the Tth Pol I expression vector. This example illustrates how such changes can be 
made by describing the construction of expression vectors in which the APl promoter 
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and either the bacteriophage T7 gene 10 or X gene N ribosome-binding site arc 
positioned for expression of Tth Pol I. 

A. rnnsmiction of Kvpression Vectors pDG160 and pDG161 

Plasmid pDG 1 60 is a XPl cloning and expression vector.that comprises the XPl 

5 promoter and gene N ribosome-binding site (see U.S. Patent No. 4,71 1,845, 
incorporated herein by refraence), a restriction site polylinker positioned so that 
sequences cloned into the polylinker can be expressed under the control of the XPi- 
Nrbs. and a transcription tenninator from tiie fiacillus thuringiensis delta-toxin gene 
(see U.S. Patent No. 4,666,848, incorporated herein by reference). Plasmid pDG160 

10 also carries a mutated RNAI I gene, which renders die plasmid temperature sensitive for 
copy number (see U.S. Patent No. 4,631,257, incorporated herein by reference). 

These elements act in concert to make plasmid pDGl60 a very useful and 
powerful expression vector. At 30-32'C, die copy number of the plasmid is low, and 
in an host cell that carries a temperature-sensitive X repressor gene, such as cI857, the 

15 Pl promoter does not function. At 37-4 1 'C, however, die copy number of the plasmid 
is 25-50-fold higher tiian at 30-32'C, and die cI857 repressor is inactivated, allowing 
the Pl promoter to function. Plasmid pDG160 also carries an ampicillin resistance 
(AmpR) marker. Plasmid pDG161 is identical to plasmid pDG160, except the AmpR 
marker is replaced with a TetR (tetracycline resistance) marker: 

20 So, plasmids pDG160 and pDG161 comprise die AmpR or TetR marker, the 

>J*L promoter, die gene N ribosome-binding site, a polylinker. die BT cry PRE (BT 
positive retroregulatory element, U.S. Patent No. 4.666,848) in a CoIEl cop" vector. 
These plasmids were constructed from previously described plasmids and die duplex 
syndietic oligonucleotide linkers DG31 and DG32. The DG31/32 duplex linker 

25 encodes a 5' Hindffl cohesive end followed by fiasi, ^SSH, KpnVAsc718, 2taVSmaI 
recognition sites and a 3' BamH l cohesive end. This duplex linker is shown below. 

Sad Heal Kpni Zmai 
DG31 ' -Bf^rTTATfiAnrTncATriaTArrrrGnG 

ATACTCGAGGTACCATGGGGCCCCTAG-5 • DG32 

30 This duplex linker and plasmid pFC54.t were used to construct plasmid pDG 1 60. 

Plasmid pFC54.t, a 5.96 kb plasmid described in U.S. patent No. 4.666,848, 
supra, and available in E- £2li K12 strain DG95 carrying die prophage XN7N53CI857 
SusPgo from die ATCC under accession number ATCC 39789, was digested with 
restriction eiizymes Hindin and BamHI. and die isolated vector fragment was ligated 

35 widi a 5-foId molar excess of nonphosphbrylated and annealed DG3 1/32 duplex. 
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Following ligation, the DNA was digested with 2flial (to inactivate die vector pFC54.t 
DNA fragment the linker replaces) and used to transform E- fioli K12 strain DG 1 1 6 
(ATCC 53,606) to ampicillin resistance. Colonies were screened by restriction enzyme 
digestion for loss of the des-ala-seri25 il.2 mutein sequence and acquisition of tiie 
5 DG3 1/32 polylinker sequence. The polylinker region in die plasmid, designated 
pDG160, of one AmpR transfonnant was sequenced to verify that the desired 
construction was obtained. 

Plasmid pAW740CHB (available in £. cdi strain K12 DG 1 16 from the ATCC 

under accession number ATCC 67,605), die source of a modified tetracycline 
10 resistance gene in which the BainHI and Hindm restriction sites were eUminated. and 
which contains the XPl promoter gene N ribosome-binding site, and BTq^ PRE in a 
ColEl cop«s vector was digested to completion witii restriction enzymes ffindin and 
BainHI and the 4.19 kb vector fragment purified by agarose gel elecffophoresis. The 
purified vector DNA fragment was ligated with a 5-fold molar excess of 
15 nonphosphorylated annealed DG3 1/32 duplex. E. £filiK12 strain DGl 16 was 

transfomied with a portion of die DNA, and TctR coldnies screened for presence of 4.2 
kb plasmids. Several transfomiants were fimher screened by DNA restriction enzyme 
digestion and by sequence analysis of die polylinker region by die Sanger mediod. 
Several transfoimants contained a plasmid with die desired sequence, and the plasmid 
20 was designated pDG161, 

Constnictio n of Expression Plasmids dDGI 64 Through nPGI SI 
To faciUtate construction of Tdi expression vectors and to increase the efficiency 
of translation initiation, plasmids pDG160 and pDG161 were altered to introduce 
changes in die XPl promoter and ribosome-binding site (RBS) region. In these 
25 alterations, plasmids pDG160 and pDG161 were digested with restriction enzymes 
EseMH and Sad and tiien Hgated witii short, syntfietic linkers to create plasmids in 
which the small EscMH-SacI restriction fragment of plasmid pDGl60 (or pDG161) 
was replaced widi die duplex linker. 

The duplex linkers used in diese constructions had different strucnires and 
30 properties. Duplex DG106/DG107 encodes die bacteriophage T7 gene 10 RBS and an 
lidfil restriction enzyme recognition site at die ATG start codon and has die stnicture: 

Nde l 

DG106 5'-CCGGAAGAAGGAGATATACAlATfiAGCT-3' 
DG107 3'-TTCTTCCTCTATATGTATAC-5' . 

35 Duplex DG108/DG1()9 encodes a modified T7 gene 10 RBS and an AssI restriction 

enzyme recognition site at die ATG stan codon and has die structure: 
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DG108 5'-CCGGAAGAAGGAGAAAAMiaaiGAGCT-3' 
DGl 09 3«-TTCTTCCTCTTTTTAATTAC-5V 

Duplex DGl lO/DGl 1 1 encodes the X Nrbs and an IWfil restriction enzyme recognition 

site at the ATG start codon and has the structure: 



DGllO 
DGlll 



Nde l 

5 ' -CCGGAGGAGAAAACAIM£AGCT-3 ' 
3 ' -TCCTCTTTTGTATAC- 5 ' 



10 



Duplex DGl 12/DG113 encodes the Nrbs and an ASfil restriction enzyme recognition 
site at the ATG start codon and has the stracture:. 

Ase l 

DG112 5'-CCGGAGGAGAAALmaiGAGCT-3' 
DG113 3»-TCCTCTTTTAATTAC-5' 

The duplexes and EsBMH-Satl-digested plasmids pDG 160 and pDG 1 6 1 were 
Ugated as shown in tabular form below to yield plasmids pDG164 through pDG171. 



15 



20 



pigftsted Vector 

pDG160 
pDG160 
pDG160 
pDG160 
pDG161 
pDG161 
pDG161 
pDG161 



DG106/DG107 
DG108/DG109 
DGl 10/DGl 11 
DG112/DG113 
DG106/DG107 
DG108/DG109 
DGl 10/DGl 11 
DG112/DG113 



Constructed 

pDG164 
pDG166 
pDG168 
pDG170 
pDG165 
pDG167 
pDG169 
pDG171 



25 These vectors, together with plasmids pDG160 and pDG161 , were also modified, prior 
to inserting the Tth Pol I gene coding sequence, to yield plasmids pDG172 tiirough 
pDG181. 

This modification resulted in the destniction of the ^51 ton) restriction 
enzyme recognition site in plasmids pDG160, pDG161, and pDG164 through 

30 pDG171. Many of the vectors of the invention comprise a CsC^SI site at the 5' end of 
tiie Ttii Pol I coding sequence. These Csii45I-deleted vectors serve as convenient 
vectors for cloning fiagments generated with restriction enzyme Csil45I or AsuU. This 
£se45I site is located in the coUciniMM gene of the plasmids and was deleted by 
digesting witii restriction enzyme C5E45I. treating the £sil45I-digested DNA with 

35 Klenow enzyme in tiie presence of all four dNTPs to obtain blunt-ended, double- 
stranded DNA. and lecircularizing die plasmid DNA by Ugation. The resulting 
plasmids. designated pDG172 through pDGl 81. are shown in tabular form below. 
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10 



StaitingPlasmd 

pDG160 
pDG161 
pDG164 
pDG165 
pDG166 
pDG167 
pDG168 
pDG169 
pDG170 
pDG171 



Designation After 
Csp4ST Site Removal 

pDG172 

pDG173 

pDG174 

pDG175 

pDG176 - 

pDG177 

pDG178 

pDG179 

pDGlSO 

pDG181 



Plasmids pDG172 through pDGlSl were then used to place the Tth Pol I gene of the 
present invention in frame for (expression under the control of the XPl promoter. 

15 Construction of Tth Pol I Expression Vectors pLSG29 Thro ugh pLSG36 

The Tth Pol I gene can be cloned into expression vectors pDG172 through 
pDGlSl to create Tth Pol I expression vectors. Several illustrative constructions are 
shown in tabular form below. 



Starting 

20 Elasmid 

pDG174 
pDG174 
pDG175 
pDG177 
25 pDG178 
pDG178 
pDG179 
pDG181 



Sourse of Tth Pol I 
Coding Sgqvignge 



Ndel -BamH I 
Ndel -Bamm 
Ndel -BamH I 
Asel -BamH I 
Ndel-Bamm 
Ndel -Bamm 
Ndel -Bamm 
Asel -Bamm 



Restiiction 
Restriction 
Restiicdon 
Restiiction 
Restriction 
Restriction 
Restriction 
Restriction 



lTagmentofpLSG24 
Fragment of pLSG28 
Fragment of pLSG24 
Fragment of pLSG23 
Fragment of pLSG24 
Fragment of pLSG28 
Fragment of pLSG24 
Fragment of pLSG23 



Tth Pol I 
Plasmid Expression 

pLSG31 
PLSG35 
pLSG32 
pLSG29 
pLSG33 
pLSG36 
pLSG34 
pLSG30 



Expression vectors pLSG29 through pLSG36 were ttansformed into E. K12 strain 
30 DG116 and cultured under conditions that allow for expression of Tth Pol I. All 
transformants yielded about die same amounts Of activity, although vectors with the 
• Nrbs may yield somewhat higher levels of activity tiian vectors with the T7rbs- The 
XPL promoter vectors also produced Ttli Pol I at levels at least an order of magnitude 
higher than the lac promoter expression vectors. 
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Example ? 

Synthesis of R ecombinant Tth Pol I Activity in E. coli 
E, coli K12 strain DGl 16 (ATCC 53,606) harboring Tth Pol I expression 
plasmids with the XPl pronwter was grown at 32*C in Bonncr-Vogcl minimal salts 
5 media containing 0.5% glucose, 10 Jig/ml thiamine, 0-25% (w/v) Difco casamino 
acidsi and an5)icillin (100 ^g/ml) or tetracycline (10 ^gAnl) as app Cells were 

grown to an Asoo of about 0-8 and shifted to 37*C to dercpress the XPl promoter 
(inactivation of cI857 repressor) and increase the copy number of the CoUEl cop" 
plasmid vector. After six to nine hours of growth at 37'C, aliquots of the cells were 
10 harvested, the cells centrifuged, and the pellets stored at -70*C. 

Alternatively, £• COli K12 strain KB2 (ATCC 53,075) harboring a Tth 
expression plasmid under the control of the USL promoter/operator can be grown for 
eight hours at 32*C in Bonner- Vogel minimal salts media containing 0.5% glucose, 5 
jig/ml tryptophan, 10 |ig/ml thiamine, 0.25% Difco casamino adds, and 100 |ig/ml 
15 ampicUlin or 10 |ig/ml tetracycline to an A6oo of 3.0, Cells were harvested as above. 

Cell pellets were rcsuspended to 5 to 10 O.D, unitsAnl in 50 mM Tris-Cl, pH 
7.5, 1 mM EDTA, 2.4 mM PMSF, and 0.5 ^g/ml leupeptin and lysed by sonication. 
Aliquots of the sonicated extracts were subjected to SDS-PAGE and analyzed by 
Coomassie staining and Western immunoblotting with rabbit polyclonal anti-Taq 
20 polymerase antibody. In addition, portions of the extracts were assayed in a high 
temperature (74*Q DNA polymerase assay. 

Western immunoblotting showed significant induction and synthesis of an 
approximately 94 kDa Tth DNA polymerase polypeptide in induced strains harboring 
Tth expression plasmids. Coomassie blue staining of SDS-PAGE-separated total cell 
25 protein revaled the presence of a new predominant protein at -94 kDa in these induced 
strains. Hnally, high temperature activity assays confirmed the significant level of 
recombinant Tth DNA polymerase synthesis in these E. £J2li strains. 

E?taniplg 8 

PCR with Tth DNA Polymerase 

3Q About 1.25 units of the Tth DNA polymerase purified in Example 1 were used 

to amplify rRNA encoding sequences ftom Ttli genomic DNA. The reaction volume 
was 50 ^il, and the reaction mbcturc contained 50 pmol of primer DG73, 105 to 10^ 
copies of tiie Tth genome (-2 x 105 copies of genome/ng DNA), 50 pmol of primer 
DG74, 200 jiM of each dNTP, 2 mM MgCh, 10 mM Tris-HCl, pH 8,3, 50 mM KCl, 

35 and 100 lig/nH gelatin (although gelatin can be omitted). 
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The reaction was carded out on a Peridn-Elmer Ceius Instruments DNA . 
Thennal Cycler. Twenty to 30 cycles of 96*C for 15 seconds; 50*C for 30 seconds, 
and 75*C for 30 seconds, were carried out At 20 cycles, the amplification product. 
(1 60 bp in size) could be faindy seen on an ethidium bromide stained gel, and at 30 
5 cycles, the product was readily visible (under UV light) on the ethidium bromide 
stained gel. 

Hie PCR may yield fewer non-specific products if fewer units (i.e., 0.3 1 U/50 
Jil reaction) of Tth are used. In addition, the addition of a non-ionic detergent, such as 
laureth-12, to the reaction mixture to a final concentration of 1% can improve the yield 
10 of PCR product 

Primers DG73 and DG74 arc shown below: 

DG73. 5' TACGTTCCCGGGCCTTGTAC 3' 
DG74 5' AGGAGGTGATCCAACCGCA 3' 
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Tn theaairm 

1 . A recombinant DNA sequence that encodes Thermus thermophilus DNA 
polyermase I activity, 

1. The DNA sequence of Oaim 1 th*at can be isolated from plasmid 
pBSM:Tth. 

3 . The DNA sequence of Qaim 1 that encodes the amino add sequence, from 
amino to carboxy tcraiinus: 

MetGluAlaMetLeuProLeuPheGluProLysGlyArgValLeuLeuValAspGlyHis 

HisLexiAlaTyrArgThrPhePheAlaLeuLysGlyLeuThrThrSerArgGlyGluPro 

V.?.iGlhAlaValTyrGlyPheAlaLysSerLeuLeuLysAlaIieuLysGluAspGlyTyr 

LysAlaValPheValValPheAapAlaLysAlaProSerPheArgHisGluAlaTyrGlu 

AlaTyrLysAlaGlyArgAlaProThrProGluAspPheProArgGlnLeuAlaLeuIle 

LysGluLeuValAspLeuLeuGlyPheThrArgLeuGl'iValPrpGlyTyrGluAlaAsp 

AspValLeuAlaThrLeuAlaLysLysAlaGluLysGluGlyTyrGluValArglleLeu 

ThrAlaAspArgAspLeuTyrGlnLeuValSerAspArgValAlaValLeuHisProGlu 

GlyHisLeuIleThrProGluTrpLeuTrpGluLysTyrGlyLeuArgProGluGlnTrp 

ValAspPheArgAlaLeuValGlyAspProSerAspAsnLeuProGlyValLysGlylle 

GlyGluLysThrAlaLeuLysLeuLeuLysGluTrpGlySerLeuGluAsnLeuLeuLys . 

AsnLeuAspArgValLysProGluAsnValArgGluLysIleLysAlaHisLeuGluAsp 

LeuArgLeuSerLeuGluLeuSerArgValArgThrAspLeuProLeuGluValAspLeu 

AlaGlnGlyArgGluProAspArgGluGiyLeuArgAlaPheLeuGluArgLeuGluPhe 

GlySerLeuLeuHisGluPheGlyLeuLeuGluAlaProAlaProLeuGluGluAlaPro 

TrpProProProGluGlyAlaPheValGlyPheValLeuSerArgProGluProMetTrp 

AlaGluLeuLysAlaLeuAlaAlaCysArgAspGlyArgValHisArgAlaAlaAspPro 

LeuAJ.aGlyLeuLysAspLeuLysGluValArgGlyLeuLeuAlaLysAspLeuAlaVal 

LeuA.laSerArgGluGlyLeuAspLeuValProGlyAspAspProMetLeuLeuAlaTyr 

LeuLeiiAspProSerAsnThrThrProGluGlyValAlaArgArgTyrGlyGlyGluTrp 

ThrGluAspAlaAlaHisArgAlaLeuLeuSerGluArgLeuHisArgAsnLeuLeuLys 

ArgLeuGluGlyGluGluLysLeuLeuTrpLeuTyrHisGluValGluLysProLeuSer 

ArgValLeiiAlaHisMetGlioAlaThrGlyValArgLeuAspValAlaTyrLeuGlnAla 

LeuSerLeuGluLeuAlaGluGluIleArgArgLeuGluGluGluValPheArgLeuAla 

GlyHisProPheAsnLeuAsnSerArgAapGlnLeuGliiArgValLeuPheAspGluLeu 

ArgLeuProAlaLeuGlyLysThrGlnLysThrGlyLysArgSerThrSerAlaAlaVal 

LeuGluAlaLeuArgGluAlaHisProIleValGluLysIleLeuGlnHisArgGluLeu 

ThrLysLeuLysAsnThrTyrValAspProLeuProSerLeuValHisProArgThrGly 

ArgLeuHisThrArgPheAsnGlnThrAlaThrAlaThrGlyArgLeuSerSerSerAsp 

P r o AsnLeuGlnAsn I leP r oVa lArgTh r P r oLeuGlyGlnArg I le ArgArgAl aPhe 

Va 1 AlaGluAlaGlyT rpAl a LeuVa lAlaLeuAspTy r Se rGln I leGluLeuAr gVa 1 

LeuAlaHisLeuSerGlyAspGluAsnLeuIleArgValPheGlnGluGlyLysAspIle 

HisThrGlnThrAlaSerTrpMetPheGlyValProProGluAlaValAspProLeuMet 

ArgArgAlaAlaLysThrValAanPheGlyValLeuTyrGlyMetSerAlaHisArgLeu 

SerGlnGluLeuAlalleProTyrGluGluAlaValAlaPhelleGluArgTyrPheGln 
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SerPheProLysValArgAlaTrpIleGluLysThrLeuGluGluGlyArgLysArgGly 
TyrValGluThrLeuPheGlyArgArgArgTyrValProAspLeuAsnAlaArgValLys 
SerValArgGlijAlaAlaGluArgMetAlaPheAsnMetProValGlnGlyThfAlaAla 
AspLeuMetLysLeuAlaMetValLysLeuPl>eProArgLeuArgGluMetGlyAlaArg 
MetLeuLeuGlnValHisAapGluLeuLeuLeuGluAlaProGlnAlaArgAlaGluGlu 
ValAlaAlaLexiAlaLysGluAlaMetGluLyaAlaTyrProLeuAiaValProLeuGlu 
ValGluValGlyMetGlyGluAspTrpLeuSerAlaLysGly 



4. TheDNAseqenceofOaim Sthatis 

5'- ATGGAGGCGA TGCTTCCGCT CTTTGAACCC AAAGGCCGGG TCCTCCTGGT 
GGACGGCCAC CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA 
CCACGAGCCG GGGCGAACCG GTGCAGGCGG TCTACGGCTT CGCCAAGAGC 
CTCCTCAAGG CCCTGAAGGA GGACGGGTAC AAGGCCGTCT TCGTGGTCTT 
TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGAG GCCTACAAGG 
CGGGGAGGGC CCCGACCCCC GAGGACTTCC CCCGGCAGCT CGCCCTCATC 
AAGGAGCTGG TGGACCTCCT GGGGTTTACC CGCCTCGAGG TCCCCGGCTA 
CGAGGCGGAC GACGTTCTCG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG 
GGTACGAGGT GCGCATCCTC ACCGCCGACC GCGACCTCTA CCAACTCGTC 
TCCGACCGCG TCGCCGTCCT CCACCCCGAG GGCCACCTCA TCACCCCGGA 
GTGGCTTTGG GAGAAGTACG GCCTCAGGCC GGAGCAGTGG GTGGACTTCC 
GCGCCCTCGT GGGGGACCCC TCCGACAACC TCCCCGGGGT CAAGGGCATC 
GGGGAGAAGA CCGCCCTCAA GCTCCTCAAG GAGTGGGGAA GCCTGGAAAA 
CCTCCTCAAG AACCTGGACC GGGTAAAGCC AGAAAACGTC GGGGAGAAGA 
TCAAGGCCCA CCTGGAAGAC CTCAGGCTCT CCTTGGAGCT CTCCCGGGTG 
CGCACCGACC TCCCCCTGGA GGTGGACCTC GCCCAGGGGC GGGAGCCCGA 
CCGGGAGGGG CTTAGGGCCT TCCTGGAGAG GCTGGAGTTC GGCAGCCTCG 
TCCACGAGTT CGGCCTCCTG GAGGCCCCCG CCCCCCTGGA GGAGGCCCCC 
TGGCCCCCGC CGGAAGGGGC CTTCGTGGGC TTCGTCCTCT CCCGCCCCGA 
GCCCATGTGG GCGGAGCTTA AAGCCCTGGC CGCCTGCAGG GACGGCCGGG 
JJSJCCGGGC AGCAGACCCC TTGGCGGGGC TAAAGGACCT CAAGGAGGTC 
CGGGGCCTCC TCGCCAAGGA CCTCGCCGTC TTGGCCTCGA GGGAGGGGCT 
AGACCTCGTG CCCGGGGACG ACCCCATGCT CCTCGCCTAC CTCCTGGACC 
CCTCCAACAC CACCCCCGAG GGGGTGGCGC GGCGCTACGG GGGGGAGTGG 
ACGGAGGACG CCGCCCACCG GGCCCTCCTC TCGGAGAGGC TCCATCGGAA 
CCTCCTTAAG CGCCTCGAGG GGGAGGAGAA GCTCCTTTGG CTCTACCACG 
AGGTGGAAAA GCCCCTCTCC CGGGTCCTGG CCCACATGGA GGCCACCGGG 
GTACGGCTGG ACGTGGCCTA CCTTCAGGCC CTTTCCCTGG AGCTTGCGGA 
GGAGATCCGC CGCCTCGAGG AGGAGGTCTT CCGCTTGGCG GGCCACCCCT 
TCAACCTCAA CTCCCGGGAC CAGCTGGAAA GGGTGCTCTT TGACGAGCTT 
AGGCTTCCCG CCTTGGGGAA GACGCAAAAG ACAGGCAAGC GCTCCACCAG 
CGCCGCGGTG CTGGAGGCCC TACGGGAGGC CCACCCCATC GTGGAGAAGA 
TCCTCCAGCA CCGGGAGCTC ACCAAGCTCA AGAACACCTA CGTGGACCCC 
CTCCCAAGCC TCGTCCACCC GAGGACGGGC CGCCTCCACA CCCGCTTCAA 
CCAGACGGCC ACGGCCACGG GGAGGCTTAG TAGCTCCGAC CCCAJ^CCTGC 
AGAACATCCC CGTCCGCACC CCCTTGGGCC AGAGGATCCG CCGGGCCTTC 
GxGGCCGAGG CGGGTTGGGC GTTGGTGGCC CTGGACTATA GCCAGATAGA 
GCTCCGCGTC CTCGCCCACC TCTCCGGGGA CGAAAACCTG ATCAGGGTCT 
TCCAGGAGGG GAAGGACATC CACACCCAGA CCGCAAGCTG GATGTTCGGC 
GTCCCCCCGG AGGCCGTGGA CCCCCTGATG CGCCGGGCGG CCAAGACGGT 
GAACTTCGGC GTCCTCTACG GCATGTCCGC CCATAGGCTC TCCCAGGAGC 
TTGCCATCCC CTACGAGGAG GCGGTGGCCT TTATAGAGCG CTACTTCCAA 
AGCTTCCCCA AGGTGCGGGC CTGGATAGAA AAGACCCTGG AGGAGGGGAG 
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GAAGCGGGGC TACGTGGAAA CCCTCTTCGG AAGAAGGCGC TACGTGCCCG 
ACCTCAACGC CCGGGTGAAG AGCGTCAGGG AGGCCGCGGA GCGCATGGCC 
TTCAACATGC CCGTCCAGGG CACCGCCGCC GACCTCATGA AGCTCGCCAT 
GGTGAAGCTC TTCCCCCGCC TCCGGGAGAT GGGGGCCCGC ATGCTCCTCC 
AGGTCCACGA CGAGCTCCTC CTGGAGGCCC CCCAAGCGCG GGCCGAGGAG 
GTGGCGGCTT TGGCCAAGGA . GGCCATGGAG AAGGCCTATC CCCTCGCCGT 
GCCCCTGGAG GTGGAGGTGG GGATGGGGGA GGACTGGCTT TCCGCC7A6G 
GTTAG-3' 

5. A recoQibinaiitDNA sequence that encodes a protein with thennostable 
DNA polymerase activity, said protein comprising a sequence of amino acids that has 
100% hcmiology to a contiguous sequence of at least five out of nine amino acids 
encoded by the Thermus thenmophilus DNA polymerase encoding sequence of Claim 
3, said contiguous sequence of nine amino acids selected from the group consisting of 
codons 238-246; 241-249. 335-343, 336-344, 337-345, 338-346. 339-347. 

6. A recombinant DNA sequence that encodes a protein with thermostable 
polymerase activity, said protein comprising a sequence of amino acids that has 100% 
homology to a contiguous sequence of at least four out of six amino acids encoded by 
the Thermus thermophilm DNA polymerase encoding sequence of Claim 3 at codons 
225-230. 

7 . A recombinant DNA vector that comprises the DNA sequence of Claim 1 . 

8. TherecombinantDNAsequenc«of Claim? selected from the group 
consisting of plasmids pBSM:Tth, pLSG21, pLSG22, pLSG23, pLSG24, pLSG27, 
pLSG28. pLSG29, pLSG30, pLSG31, pLSG32. pLSG33, pLSG34. pLSG35, and 
pLSG36. 

9. The recombinant DNA sequence of Claim 8 that is plasmidpBSM-.Tth. 

10. A recombinant DNA vector selected from the group consisting of plasmids 
pBSM:Tth5'. pBSM:TthAStuI/Hindm, pLSG25, and pLSG26. 

1 1 . A recombinant host cell transformed with a vector of Claim 7. 

1 2. The recombinant host cell of Claim 1 1 that is E £Qli- 
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13. Hie recombinant host cell of paim 12, tiansfonned with a vector selected 
from the group consisting of plasmids pBSM:Tth, pLSG21 , pLSG22, pLSG23, 
pLSG24, pLSG27, pLSG28. pLSG29, pLSGSO, pLSGS 1, pLSG32. pLSG33, 
pLSG34. pLSG35. and pLSG36. 

14. The recombinant host cell of Oaim 12 that is E. coli K12^BSM:Tth. 

15. A iriethod for pmifyingThernm thermophilus DNA polymerase I from 
7. /A«nR£>pMu5 cells, said method conq)rising: 

(a) pi^aiing a crude cell extract from said cells; 

(b) adjusting the ionic strength of said extract so that said polymerase 
dissociates from any nucleic acid in said extract; . 

(c) subjecting the extract to hydrophobic interaction chromatography; 

(d) subjecting the extract to DNA binding protein affinity chromatography; 

(e) subjectbg the extraa to nucleotide binding protein affinity chromatography; 

and 

(f) subjecting the extract to chromatography selected from the group consisting 
of anion exchange, cation exchange, and hydroxyapatite chromatography. 
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